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I.  INTRODUCTION 

Codes  for  correcting  large  multiple  random  errors  are  not  used 
extensively  in  practical  data  transmission  systems  because  of  equipment 
complexity.  Threshold  decoding  is  a  method  of  error  correction  which  is 
especially  suitable  for  machine  implementation  because  the  logical  circuit  to 
realize  the  threshold  decoding  is  usually  very  simple.  Finding  cyclic  codes 
that  can  be  decoded  by  threshold  logic  becomes  important. 

Threshold  decoding  of  block  codes  was  introduced  by  Reed  who  devised 
a  decoding  the  class  of  codes  discovered  by  Muller  [l4]  .  Massey 

devised  many  threshold  decoding  algorithms  for  recurrent  codes  as  well  as 
block  codes.  His  book  [l3]  "Threshold  decoding"  includes  a  comprehensive 
discussion  of  the  work  on  threshold  decoding  for  block  codes  before  1963. 
Rudolph's  [l6]  threshold  decoding  algorithm  differs  from  previous  algorithms 
in  that  the  estimates  (parity  checks)  are  not  necessarily  orthogonal.  His 
projective  geometry  codes  are  specified  in  terms  of  parity  check  matrices. 

The  determination  of  the  number  of  check  digits  lies  on  the  determination  of 
the  rank  of  the  parity  check  matrix  which  is  often  not  easy,  especially  when 
the  code  length  becomes  large.  It  is  therefore  necessary  to  develop  a  theory 
to  described  his  code  in  terms  of  roots  of  the  generator  polynomial.  The 
generator  polynomial  can  then  be  obtained  by  multiplying  irreducible  poly¬ 
nomials  found  from  a  mathematical  table  such  as  the  one  from  reference  [l4j  . 
Description  of  the  code  in  term  of  the  generator  polynomial  is  essential  in 
simplifying  implementation. 

Weldon  [l?]  has  discoved  a  class  of  cyclic  codes  based  on  difference 
sets.  Graham  and  MacWilliams  [43b*ve  studied  the  number  of  information 
digits  of  difference-set  cyclic  codes.  The  class  of  difference -set  cyclic 
codes  is  a  subclass  of  Rudolph’s  projective  geometry  codes.  Kasami  [b] 
has  shown  that  Reed-Muller  codes  are  equivalent  to  primitive  cycles  codes 


with  an  overall  parity  check  bit  added.  The  cyclic  property  of  these  codes 
simplifies  the  decoding  algorithm  of  the  Reed -Muller  codes  and  also  makes 
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them  more  tractable  mathematically.  These  codes  can  now  be  described 
in  terms  of  the  roots  of  the  generator  polynomials  which  suggests  a  natural 
generalization  to  non -binary  cases.  Weldon  has  investigated  non -primitive 
Reed-Muller  codes  [lfi]  which  include  as  subclasses  the  primitive  Reed- 
Muller  codes  and  difference -set  codes  and  has  found  a  decoding  algorithm  for 
them.  The  non -primitive  Reed-Muller  code  is  a  subcode  of  the  Rudolph’s  pro¬ 
jective  geometry  code,  and  the  decoding  algorithm  for  the  non -primitive  Reed- 
Muller  code  is  applicable  to  the  Rudolph’s  projective  geometry  code.  In  this 
thesis,  two  related  classes  of  codes  derived  from  Euclidean  geometries  are 
presented.  We  call  them  Euclidean  geometry  codes  and  modified  Euclidean 
geometry  codes.  The  generator  polynomial  of  a  Euclidean  geometry  code 
is  (  x  -  1)  times  that  of  the  corresponding  modified  Euclidean  geometry  code. 
The  code  symbols  of  these  codes  can  be  chosen  from  any  field  containing  a 
particular  prime  field  GF  (p).  The  dual  of  a  Euclidean  geometry  cotie  over 
GF  (q)  is  a  subcode  of  a  q-ary  (q  is  a  power  of  prime  p)  modified  Reed-Muller 
code  which  contains  the  parity  checks  required  to  make  majority  voting.  We 
derive  a  class  of  codes  from  projective  geometries  in  terms  of  the  roots  of 
generator  polynomials.  The  discovery  of  these  codes  are  independent  of 
Weldon's  work  on  non -primitive  Reed-  Muller  code  [is]  .  These  codes  are 
better  than  the  corresponding  non -primitive  Reed-Muller  codes  in  general 
because  they  have  more  information  digits  and  have  the  same  error  -correcting 
ability  by  L-step  orthogonalization  procedure.  Theoretically  Rudolph’s 
projective  geometry  codes  contain  the  newly  established  projective  geometry 
codes  as  subcodes.  So  far  we  have  not  found  any  case  in  which  they  are 
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different.  The  codes  from  finite  geometries  including  Euclidean  geometries 
and  projective  geometries  are  subcodes  of  BCH  codes  of  the  same  length. 

Thus  these  codes  are  not  as  efficient  as  BCH  codes  in  general.  However, 
for  the  most  interesting  values  of  code  length  and  rare  the  difference  between 
finite  geometry  codes  and  BCH  codes  is  slight.  In  an  attempt  to  iind  a 
general  threshold  decoding  algorithm  for  BCH  codes,  we  found  that  a  class  of 
BCH  codes  cannot  be  L-step  orthogonalizcd.  However,  all  codes  (including 
some  BCH  codes)  whose  extension  codes  are  invariant  under  a  doubly  transitive 
permutation  group  can  be  decoded  by  one -step  threshold  decoding.  Some  of  the 
BCH  codes  turned  out  to  be  comparable  with  codes  related  to  finite  geometries 
by  this  method. 

Recently,  coding  has  been  applied  to  information  retrieval.  Kautz 
and  Singleton  [9j  have  proposed  using  ze ro -false -drot  codes  for  information 
retrieval.  Chien  and  Frazer  [3]  have  derived  methods  for  document 
retrieval  from  algebraic  coding  theory.  Two  new  classes  of  zero-false  -drop 
codes  have  been  derived  from  finite  geometries  These  codes  provide  more 
useful  parameters  thin  the  previous  ones.  Investigation  has  been  made  of 
the  use  of  error -correcting  codes  for  information  retrieval.  Several 
interesting  results  have  been  obtained. 

The  material  in  this  paper  is  arranged  as  follows.  In  chapter  2. 
we  present  the  codes  derived  from  Euclidean  geometries.  In  chapter  3.  the 
polynomial  version  of  projective  geometry  codes  is  given.  In  chapter  4  we 
investigate  the  threshold  decoding  of  cyclic  codes,  including  BCH  codes  and 
finite  geometry  codes.  In  chapter  5.  the  application  of  coding  theory  to  in¬ 
formation  retrieval  is  presented.  Finally  we  have,  in  chapter  0.  the  con 
elusion  and  suggestions  for  promising  areas  of  future  research 
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II.  CODES  DERIVED  FROM  EUCLIDEAN  GEOMETRIES 
2. 1  Preliminaries 

Codes  derived  from  Euclidean  geometries  are  closely  related  to  Reed- 
Muller  codes,  we  first  introduce  some  background  concerning  Euclidean 
geometries  and  then  indicate  the  connection  between  Euclidean  geometries  and 
Reed-Muller  codes- 

Let  a  be  a  primiti  ve  element  in  GF(qm)  -  As  the  elements 
a  *(i  *  0,  1 . .  m  -  1)  are  linearly  independent  over  GF(q),  we  may  write 


m-1 

vij a  *  •  0<j<qm-2 

i=0 


where  v.  .  is  in  GF(c). 

ij 

Let  G  be  a  matrix  defined  as 


(2.1) 


G  - 


0 


1° 


V(m-1)0  V(m-1)  1  *  *  V{m-lXqm*2) 


10 

V11 

•  *  Vl(qm-2) 

00 

V01 

*  '  V0(qm-2) 

(2.2) 


The  first  column  of  G  corresponds  to  the  zero  element  in  GF(qm)  and  the 

jtn  column  (2  <  j  <  qm  )  corresponds  a  *  ^  in  GF(qm). 

We  can  associate  each  column  of  the  matrix  in  equation  (2.  2)  a  point 

in  a  Euclidean  geometry  of  m -dimension  over  GF(q),  denoted  as  EG(m,  q)  [2], 

2  am-2 

EG(m,q)  consists  of  q11  points  0,  1,  a  ,  a  . a  ^  .  An 

u -dimensional  flat  of  EG(m,  q)  consists  of  qu  points  ajai+a2a2  +  ." 

+  a  a  +  y  where  a  (1  <  i  <  u)  are  elements  in  GF(qm)  and  are 
u  u  i  -  - 

linearly  independent  over  GF(q).  a^  a^. . a^  may  run  independently 
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over  GF(q)and  y  is  a  fixed  element  in  GF(qm)  [2].  These  qu  points  are 
solutions  to  the  m-u  linearly  independent  equations  of  m  unknowns.  A 
vector  associated  with  an  u-dimensional  flat  is  defined  to  be  a  vector  with 
1  (multiplicative  identify  in  GF(q) )  in  the  positions  corresponding  to  all 
points  in  this  flat  and  0  (additive  identity  in  GF(q))otherwise. 

Kasami  et  al.  [6]  has  defined  a  q-ary  v-th  order  modified  Reed- 
Muller  code  to  be  a  cyclic  code  whose  generator  polynomial  contains  the 
roots  a  “  for  all  positive  integers  h  such  that  the  weight  of  h  over  base  q 
or  the  "digit"  sum  over  the  real  field  of  q-ary  representation  of  h  is  greater 
than  zero  but  less  than  m(q-l)  -  v.  That  is,  if 
m-1 

h=  ^  hi<!1  :  0<h.<q-l 

i=0 

m-1 

w^(h)  *  ^  h.  .  ah  is  a  root  of  a  q-ary  modified  Reed- 

i*0 

Muller  code  provided 

m-1 

0<  wq(h)  *  h.  <,  m  (q  -1)  -  v  (2.3) 

i-0 

It  has  been  shown  [6]  that  a  q-ary  v-th  order  Reed -Muller  code  can 
be  obtained  by  adding  an  overall  parity  check  as  a  first  digit  to  every  code 
word  of  a  q-ary  v-th  order  modified  Reed-Muller  code.  It  is  well  known 
Cl 4)  that  a  binary  (m-u)-th order  Reed-Muller  code  contains  all  vectors 
associated  with  u-dimensional  flats.  From  now  on,  we  shall  use  the  term 
"an  u-dimensional  fiat"  instead  of  "the  vector  associated  with  an  u -dimen¬ 
sional  flat"  when  no  ambiquity  arises.  It  has  been  shown  [f>]  that  a  q-ary 
(m-u)  (q-l)-lh  order  Reed-Muller  code  contains  all  u -dimensional  flats  of 


■*. 
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EG(m,q)  passing  through  the  point  corresponding  to  the  zero  element  in 
GF(qm).  In  general,  a  q-ary  (m-u)  (q-l)-th  order  Reed-Muller  code 
contains  all  u-dimensional  flats  of  EG(m,q).  The  proof  can  be  given  by  an 
argument  similar  to  that  used  in  reference  [6]  . 

2.2  Euclidean  Geometry  Codes 

Let  q  be  equal  to  ps  where  p  is  a  prime  and  s  is  a  positive  integer. 

In  this  section,  we  are  going  to  present  a  cyclic  code  over  GF(p)  which  has 
as  parity  checks  all  u-dimensional  flats  of  EG(m,q)  with  their  first  digits 
deleted. 

From  the  preceding  section,  q-ary  (m-u)  (q-l)-th  order  modified 

Reed-Muller  code  contains  all  the  vectors  vu  obtained  by  deleting  the  first 

digits  of  the  vectors  associated  with  the  u-dimensional  flats.  From  equation 

(2. 3), the  generator  polynomial  g1(x)  of  a  q-ary  (m-u)  (q-l)-th  order 

h 

modified  Reed-Muller  code  contains  a  as  roots  for  h  satisfying 

0<  w^(h)<  u  (q-1)  (2.4) 

Let  vy(x)  be  the  polynomial  corresponding  to  v^.  vy(x)  ls  a  polynomial 
whose  coefficients  is  either  1  or  0  hence  v  (x)  can  ^  considered  as  a  polyr 
nomial  over  GF(p).  Since  v^x**)  is  equal  to  (vy(x)  )*\  vy(x)  contains  a  ^  as 
a  root  if  it  contains  a  ^  as  a  root. 

Let  be  a  code  over  GF(q)  whose  generator  polynomial  g2(x) 
contains  cr  ^  as  roots  for  h  satisfying  the  condition  (2.4). 

is  a  subcode  of  a  q-ary  (m-u)  (q-l)-th  order  modified  Reed-Muller  Code  and 
contains  ail  u-dimensional  flats  with  their  first  digits  deleted. 
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g2<x)  is  a  polynomial  over  GF(p).  We  now  show  that  the  cyclic  code 
CQ  over  GF(p)  with  g2<x)  as  its  generator  polynomial  also  contains  the  u- 
dimensional  flats  with  their  first  digits  deleted  as  code  words.  g2<x)  is  ^ 
least  common  multiple  of  the  minimal  polynomials  m.(x)  over  GF(p)  of  the 
roots  a*1  for  h  satisfying  the  condition  (2.4).  The  polynomial  vy(x) 
associated  with  an  u -dimensional  flat  is  divisible  by  any  such  m^x),  hence 
vu(x)  is  divisible  by  the  least  common  multiple  of  these  m.(x).  vy(x)  has  its 
coefficients  in  GF(p)  and  is  divisible  by  g2(x).  The  code  CQ  contains  all 
u-dimensional  flats  with  their  first  digits  deleted  as  code  words. 

hJ 

Let  C  be  the  dual  of  the  code  CQ.  g^(x)  contains  a  as  roots 
for  h  satisfying  the  condition  (2. 4).  The  reciprocal  polynomial  g*(x)  °f 

hn  ^ 

g2<x)  contains  a  K  as  roots  for  h  satisfying 


m(q-l)>  wq(h)>  (m-u)  (q-1) 


(2.5) 


The  generator  polynomial  gg(x)  of  the  code  C  contains  all  q  -1  -th  roots  of 

L 

unity  which  are  not  roots  of  g£(x).  a  are  roots  of  gg(x)  for  any  non- 
negative  integer  h  less  than  qm-l  and  satisfying  the  condition 


0<wq  (hpj)<(m-u)  (q-1)  ;  05i5s-l 


(2.6) 


Next  we  show  that  the  lower  bound  of  the  minimum  distance  of  the  code 


d  *  2  +  q  + _ +  qm  l* 


(2.7) 


This  can  be  achieved  by  showing  that  any  nonnegative  integer  h  less 
than  l  +  q  +  ...+qmu  satisfies  the  condition  (2.6).  Let 


in  a 

h-  Yj  hiq*  :  °lhi5q'1 


(2.8) 
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If  h.  0  for  all  1  such  that  0< i<  m-u,  then  h>  1  +  q  +  . . .  +qm  u 
Thus  for  any  nonnegative  integer  h  which  is  less  than  1  +  q  +  . . .  +  qm  u, 

h.  =  0  for  m-u+l<i<  m-1  and  (2.9) 

for  at  least  one  i  in  the  range  of  0<  i<  m-u 
For  any  h  satisfying  condition  (2.  9),  one  can  easily  verify  that 
Wq(hpJ)<  (m-u)  (q-1)  for  all  j.  Thus  the  generator  polynomial  of  the  code 
C  contains  1  +  q  +  . . .  +  qm  u  consecutive  roots.  The  minimum  distance 

of  this  code  is  at  least  2  +  q  + . +  qm  u. 

Since  C  and  Cq  are  dual  codes,  all  code  words  of  C  satisfy  the  parity 
checks  specified  by  u-dimensional  flats. 

We  shall  call  the  code  C  a  Euclidean  geometry  code  with  parameters 

g 

m,  u  ,  q.  The  code  exists  for  any  power  of  prime  q(q=*p  ,  where  p  is  a 
prime  and  s  is  a  positive  integer),  any  positive  integer  m  and  any  integer  u  fo* 
l<u<  m-1. 

Theorem  2.1:  A  Euclidean  geometry  code  with  parameter  m,  u,  q  has  code 
length  qm-l.  The  number  of  parity  checks  of  this  code  is  the  number  of 
nonnegative  integer  h  less  than  qm-lsuch  that  the  weight  of  the  integers 
hp^  (  0<  j<  8-1)  over  base  q  is  no  more  than  (m-u)  (q-1).  The  minimum 
distance  of  this  code  is  at  least  2  +  q  +  . . .  +  qm  U. 

The  mo^t  important  subclass  of  Euclidean  geometry  codes  is  binary 
codes.  That  is,  the  case  when  p  equals  to  2.  We  list  some  binary 
Euclidean  Geometry  codes  in  Table  2. 1.  The  entries  in  this  table  is  (n,  k. 
d)  where  n  is  the  code  length,  k  is  the  number  of  information  digits  and  d  is 
the  lower  bound  on  minimum  distance  ( d  ■  2  +  q  + - +  qm  u  ). 
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Table  2. 1  Binary  Euclidean  Geometry  Cyclic  Codes 


EG<m,q) 

u  =*  m-1 

u  =  m-2 

u  =  m-3 

u  =  m-4 

EG  (3,2) 

EG  (4,2) 

EG  (5,2) 

(7.3.4) 

(15.10.4) 

(31.25.4) 

(15.4.8) 

(31.15.8) 

(31,5,16) 

EG  (6,2) 

(63,56,4) 

(63,41,8) 

(63,21,16) 

(63,6,32) 

EG  (2,4) 

BG  (3,4) 

EG  (4, 4) 

(15.6.6) 

(63.47.6) 

(255.230. 6) 

(63.12.22) 

(255.126.22) 

(255,20,86) 

EG  (5,4) 

(1023,987,6) 

(1023,747,22) 

(1023,287,86) 

(1023,32,342) 

EG  (2,8) 

EG  (3,8) 

EG  (4, 8) 

(63.36.10) 

(511.447.10) 

(4095.3970. 10) 

(511. 138.74) 

(4095.2584.74) 

(4095,405,586) 

EG  (2, 16) 
EG  (3, 16) 

EG  (2,32) 

EG  (2,64) 

(255, 174, 18) 
(4095,3839,38) 

(1023,780,34) 

(4095,3366,66) 

(4095,1376,274) 

! 

The  first  four  rows  of  this  Table  are  codes  associated  with  binary 
Reed-Muller  codes  and  are  known.  (15,6)  code  is  a  BCH  code.  The  rest 
of  the  codes  seem  to  he  new. 
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2. 3  Decoding  of  Euclidean  Geometry  Codes 

In  this  section,  we  show  that  by  using  u-step  orthogonalization  pro¬ 
cedure  similar  to  the  Reed  decoding  algorithm  we  can  decode  the  Euclidean 
geometry  code  C  with  parameters  m,  u,  q  to  the  bound  on  the  minimun 
distance  d  [l8]. 

The  code  C  satisfies  the  u-dimensional  flat  parity  checks.  In  the  first 
step,  we  determine  u-1 -dimensional  flat  check  sums  from  u-dimensional 
flats.  In  the  u'  -th  step  (  1  u'  <  u)  we  determine  u-u'-  dimensional  flat  check 
sums  from  u-u'+  1 -dimensional  flats. 

The  following  theorem  is  essential  to  the  implementation  of  the  decoder. 

Theorem  2.2  :  For  a  given  u'  -1 -dimensional  flat  (  1  <  u'  <  u),  the  number  of 
u'  -dimensional  flats  containing  this  u'  -1 -dimensional  flat  is  1  +  q  +  . . . .  + 
qm  U  .  Any  two  of  these  u'  -dimensional  flats  has  no  points  in  common 
except  the  points  in  this  u'  -1  -dimensional  flat. 

Proof:  Consider  a  particular  u’-l -dimensional  flat  which  consists  of  q11  1 

points  a  j  +  a2a  2  +  ’  ‘  +  au*  -1 a  u'  -1 '  °  i^  5  *  ~  u'  are  ^near^Y 

independent  points,  a^  (1  <  i  <u’-l)  may  run  independently  over  GF(q). 

The  u' -dimensional  flat  containing  this  u’ -1 -dimensional  flat  consists  of  the 

points  +  +\-lV-l  + W  •  °l(1i 

linearly  independent  points.  a^,  run?  over  all  elements  in  GF(q).  The 

number  of  choice  of  a  ,  is  qm  -  q  In  a  fixed  u'  -dimensional  flat, 

u 

the  number  of  choice  of  a  ,  is  qu  -  qU  *.  The  number  of  distinct 

u 

u  -dimensional  flats  containing  this  u' -1 -dimensional  flat  is  (qm-qU  *)/ 

(qu  -  qu  1  )  or  l  +  q  +  ...  +q  u.  The  u’ -1  -dimensional  flat  and  a 
point  not  in  this  u' -l -dimensional  flat  specify  a  u' -dimensional  flat  uniquely. 

If  two  u'  -dimensional  flats  have  a  point  outside  this  u'-l  dimensional  flat  in 
common,  they  must  be  identical. 
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In  general,  the  u'  -1-dimensional  flat  consists  of  points  of  the  form 
a.^a  ^  +  &2a 2  +  au' -1  a u’ -1  +  ?  w^ere  7  is  a  fixed  element  in 
GF(qm).  The  u' -dimensional  flat  containing  this  u'-l-dimensional  flat  con¬ 
sists  of  points  a1  a  j  +  a2  o  2  +  •  •  •  +  au*  a  u>  +  au*  a  u*  +  7  •  Adding 
y  to  the  points  in  EG(m,  q)  can  be  considered  as  a  permutation  of  points  in 
EG(m,q).  The  argument  in  the  preceding  paragraph  is  still  true  for  this 
u'-l -dimensional  flat.  This  proves  the  theorem. 

The  number  1  +  q  +■  . . .  +  qm  u  is  no  less  than  1  +  q  +  . . .  +  qm  u 
for  any  u'  (1  <  u'  <  u).  We  can  always  pick  d-1  (d  »  2  +  q  +  . . .  +  qm  U) 
number  of  u' -dimensional  flats  orthogonal  on  a  particular  u’ -1 -dimensional 
flat  to  determine  a  parity  check  sum  corresponding  to  this  u’-l -dimensional 
flat.  The  determination  will  be  correct  provided  the  number  of  errors 
occurred  is  no  more  than  £(d-l)  /  2j.  The  decoder  consists  of  u  levels  of 
majority  logic.  In  the  u-th  level,  we  need  a  majority  gate  to  determine  the 
point  (0 -dimensional  flat)  corresponding  the  first  digit  position  of  a  code 
word.  The  input  to  this  majority  gate  is  1  +  q  +• . . .  +  qm  U  1  -dimensional 
flat  parity  check  sums  orthogonal  to  this  point.  We  use  1  +  q  +  . . . .  +  qm  U 
majority  gates  in  the  (u  -  l)-th  level.  The  decoder  is  tree -like.  In  the 
j  -th  level  (1  <  j  <u),  we  use  (1  +  q  +  . . .  +  qm  u)  u  ^  majority  gates  to 
determine  the  same  number  of  (u-j) -dimensional  flats.  The  total  number 
of  majority  gates  is 

u 

I«  >  (1  +  q  + - +  q  )j 

J-l 

The  choice  of  (u-j) -dimensional  flat  parity  check  sums  in  j-th  level 
is  not  unique.  The  construction  of  the  decoder  is  not  unique.  Furthermore, 
the  u’(l  <  u'  <  u)  dimensional  flat  used  in  the  majority  voting  may  not  be 


* 
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linearly  independent.  We  may  not  need  I  number  of  majority  gates  if  this 
is  the  case.  The  tree -like  decoder  is  not  necessarily  the  best  one.  Simplifi¬ 
cation  in  circuitry  is  possible  by  detail  evaluation  of  the  dependency  of  the 
parity  checks  required. 

We  now  give  an  example  of  the  binary  tl5, 6)  Euclidean  Geometry  code 
to  illustrate  the  method  of  obtaining  parity  checks  required  for  threshold  de  - 
coding. 

2 

Example:  Take  q  =  2  ,  m  =  2,  and  u  ■  1.  Let  a  be  a  root  of  primitive 

4  4  4 

polynomial  x  +  xt  1  over  GF(2  ).  or  is  a  primitive  element  of  GF(2  ). 

4 

All  the  elements  in  GF(2  )  are  linear  combinations  over  GF(2)  of 
2  3 

1,  a  ,  a  ,  a  as  follows: 

ro  .  2  3  4  5  6  7  8  9  10  11  12  13  14] 

OlaraaQ'O'QraQrQror  a  a  a  aj 


r  3  2  n 

- lar  a  a  lj 


0000100110101111 

0001001101011110 

0010011010111100 

0100010011010111 


(2. 10) 


5  2 

Let  P  ■  0  ,  then  ft  is  a  primitive  element  of  GF(2  ).  We  can 

consider  GF(2*)  as  an  extension  field  of  GF(2^).  a  is  a  primitive  element 

4  4  2 

of  GF(2  ).  All  elements  in  GF(2  )  are  linear  combinations  over  GF(2  ) 

2 

of  1,  a  .  All  elements  in  GF(2  )  can  be  written  as  linear  combinations  of 
1,  0.  From  equation  (2. 10).  all  elements  in  GF(2*)  can  be  written  as 
linear  combinations  of  1,  p,  a  ,  a  p  over  GF(2)  as  follows: 
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i^i] 


2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

0 

1 

a 

a 

a 

a 

a 

a 

a 

a 

a 

a 

a 

Of 

0 

0 

0 

0 

1 

0 

0 

1 

1 

0 

1 

0 

1 

1 

1 

1 

0 

0 

1 

1 

1 

1 

0 

0 

0 

1 

0 

0 

1 

1 

0 

1 

0 

0 

0 

1 

1 

0 

1 

0 

1 

1 

1 

1 

0 

0 

0 

1 

0 

1 

0 

0 

0 

1 

0 

0 

1 

1 

0 

1 

0 

1 

1 

1 

(2.11) 


Since  IS  satisfies  the  primitive  polynomial  x  +  x  +  1  over  GF(2), 


0  «0  +  i 


(2.  12) 


Equation  (2. 11)  can  be  rewritten  as 

n  .  2  3  4  5  6  7  8  9  10  11  12  13  14l 

0  1  aaaaaaaaaa  a  a  a  a  J 


■[•  ‘] 


o  o  l  i  02  i  o  3  3  1  3  0  32  P2  3  32 
o  l  o  ^  ^  l  ^  o  32  02  3  32  o  l  l  32 


(2.13) 


It  is  easy  to  see  from  equation  (2, 13)  that  a  is  a  root  of  a  primitive 

2  2 
polynomial  x  +  x  +  /3  over  GF(2  ). 

The  generator  polynomial  g  (x)  of  the  Euclidean  geometry  code  C 

c 

L 

contains  a  as  roots  for  the  integers  h  such  that 

0  <  w4(h  2j)  <(2  -  1)  (4  *  1)  ;  0  <  j  <  2  -  1 
The  integers  h  satisfy  this  condition  are  0,  1,  2,  4,  8,  3,  6,  9  and 

12. 

The  generator  polynomial  g^<x)  of  the  dual  CQ  of  the  code  C 
contains  a  ^  as  roots  for  the  h's  equal  to  1,  2,  4,  8,  5  and  10.  The  q-ary 
(q*4)  third  order  modified  Reed -Muller  code  has  generator  polynomial  g^(x) 
contains  a  ^  as  roots  for  the  h's  equal  to  1,  2, 4, 8  and  5.  The  code  Clover 
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I 

r 

i 

i 


GF(2  )  with  g2(x)  as  its  generator  polynomial  is  a  proper  subcode  of  the 
q-ary  third  order  modified  Reed-Muller  code. 

From  matrix  (2. 13),  the  1 -dimensional  flats  over  GF(22)  of  EG(2,22) 
passing  through  the  point  a  0  **  1  are  as  follows. 


4 


1 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0 

1 

0 

0 

0 

(2. 14) 

0 

1 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

1 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

i 

4 

i 

A 

1 

4 


I 

4 

1 

J 

i 

I 

« 


The  i-th  row  (1  <  i  <  5)  corresponding  to  a  1-dimensional  flat  con¬ 
sists  of  a  a  *  1  +  1  where  a  runs  over  GF(22). 

By  deleting  the  first  column  of  matrix  (2. 14),  we  have  a  set  of  5  parity 
checks  orthogonal  to  the  point  a  °.  These  are  parity  checks  for  the  binary 
(15, 6)  code. 

2.4  Modified  Euclidean  Geometry  Codes 

In  this  section,  we  define  a  modified  Euclidean  geometry  code  which 

has  one  more  information  digit  than  the  corresponding  Euclidean  geometry 

code  and  has  a  lower  bound  for  minimum  distance  less  than  that  of  the  latter 

code  by  one  and  can  be  u-step  decoded  up  to  this  bound. 

Let  gp(x)  be  the  generator  polynomial  of  the  Euclidean  geometry  code 

with  parameters  m,  u,  q.  We  define  the  corresponding  modified  Euclidean 

geometry  code  by  the  code  generated  by  g  (x)  /(x-1).  Obviously,  the  lower 

c 

bound  for  the  minimum  distance  of  this  code  is  1  +  q  +  . .. .  +  qm  u.  q-ary 
(m-u)  (q-i)-th  order  Reed-Muller  code  contains  all  the  vectors  associated 
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with  u-dimensional  flats.  The  generator  matrix  of  G  of  this  Reed-Muller 
code  would  have  the  property  that  the  vector  with  all  1  entries  is  the  only 
vector  with  nonzero  entry  in  the  first  digit.  If  we  delete  the  all  1  vector 
in  G,  the  row  space  of  the  new  matrix  contains  all  the  vectors  in  the  row 
space  of  whose  first  digit  is  equal  to  zero.  Let  v  be  a  vector  obtained 
by  deleting  the  first  digit  of  a  vector  corresponding  to  an  u-dimensional  flat 
which  does  not  pass  through  the  first  point.  From  this  argument,  a  code 
with  generator  polynomial  (x-1)  g^(x)  contains  all  the  vectors  v  '  where 
gj(x)  is  the  generator  polynomial  of  q-ary  (m-u)  (q-l)-th  order  modified 
Reed-Muller  code.  Thus  the  modified  Euclidean  geometry  code  satisfies 
all  u-dimensional  flat  parity  checks  which  do  not  pass  through  the  deleted 
point.  The  total  number  of  u'  -dimensional  flats  (1  <  u’  <  u)  containing 
a  particular  (u'-l)-dimen3ional  flat  which  does  not  pass  through  the  deleted 
point  is  1  +  q  +  . . . .  +  qm  u  .  Only  one  of  these  u’  -dimensional  flat  con¬ 
tains  the  deleted  point.  We  can  decode  this  modified  Euclidean  geometry 
code  to  the  distance  1  +  q  +  . . . .  +  qm  U  by  u-step  orthogonalization  pro¬ 
cedure. 

We  can  obtain  some  binary  modified  Euclidean  geometry  codes  easily 
from  Table  2. 1. 
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III.  CODES  DERIVED  FROM  PROJECTIVE  GEOMETRIES 
3. 1  Introduction 

A  class  of  cyclic  codes  suitable  for  threshold  decoding  has  been 
developed  by  Rudolph  [l6]by  using  the  properties  of  projective  geometries. 

He  first  found  a  majority  decoding  algorithm  which  does  not  require  the 
parity  checks  used  in  majority  voting  to  be  orthogonal.  The  guaranteed 
error  correction  of  a  code  can  be  determined  easily  as  the  parity  check 
matrix  is  an  incidence  matrix  of  a  balanced  incomplete  block  design  which 
is  also  cyclic.  However,  it  is  often  necessary  to  compute  the  rank  of  the 
parity  check  matrix  individually.  Weldon  [l8]  defined  and  developed  the  non¬ 
primitive  Reed -Muller  codes  in  terms  of  the  roots  of  generator  polynomials. 
An  important  subclass  of  non -primitive  Reed-Muller  codes  are  subcodes  of 
Rudolph's  projective  geometry  codes.  Following  Weldon's  approach,  the 
generator  polynomials  of  these  non -primitive  Reed-Muller  codes  can  be  found 
easily  and  a  more  powerful  decoding  scheme  is  also  applicable.  However, 
the  Rudolph  version  of  these  codes  generally  possess  a  larger  number  of 
information  digits. 

In  this  chapter,  we  describe  codes  derived  from  projective  geometries 
in  terms  of  the  roots  of  the  generator  polynomials.  These  codes  are  better 
than  Weldon's  non-primitive  Reed-Muller  codes  becaase  they  have  more 
information  digits  in  general.  These  codes  are  generally  subcodes  of 
Rudolph’s  codes.  So  far  we  have  not  yet  found  any  of  the  cases  that  these 
codes  are  not  identical  to  Rudoiph’s  projective  geometry  codes.  A  better 
code  which  contains  the  new  code  as  a  subcode  and  which  was  all  required 
parity  checks  is  given.  In  some  special  case,  this  code  is  identical  to  the 
Rudolph's  projective  geometry  code.  The  description  of  the  generator  poly¬ 
nomial  of  this  code  is  somewhat  less  easy  than  the  previous  one. 
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In  an  attempt  to  find  the  polynomial  version  of  the  Rudolph's  code, 
we  have  succeeded,  independently  of  Weldon's  work,  in  constructing  the 
generator  polynomials  of  a  class  of  cyclic  codes.  It  is  shown  in  section 
3.3  that  these  codes  include  Weldon's  codes  as  subcodes  in  general  and 
they  possess  a  larger  number  of  information  digits  in  a  number  of  cases. 

3.2  Rudolph's  Projective  Geometry  Codes 

First,  let  us  introduce  Rudolph's  majority  decoding  algorithm.  Let 

A  =[a.  j  ,  i  =  0,  1,  - -  b-1;  denote  the  parity  check  matrix  of  a  cyclic 

code  over  GF(p).  Suppose  the  leftmost  column  of  A  contains  r  nonzero 
elements,  namely,  a.  _,  k»l,2 .  r.  Consider  A  ®  a.  .  , 

1k°  0  V 

k-1,2 . r  a  submatrix  of  A.  A  received  sequence  B  =  (b  ,  b^,  ... 

byl)  is  a  vector  sum  of  a  transmitted  code  word  C  =  (cq,  c^  ...,  c  j) 

and  an  error  vector  E  «  (eo>  ....  ey_j).  To  decode  received  digit 

bo>  we  first  muitiply  the  matrix  Aq  by  the  transpose  of  the  received  sequence 

T 

B  and  set  the  product  AqB  equal  to  zero.  The  resulting  equations  are 


V  A 

Y  a.  b 
<->  »iJ  1 


k-1,2,...,  r 


(3. 1) 


Treating  bQ  as  an  unknown  and  solving 


3  «  -  a1  y  a 
o  i.o  <£_,  1, 


>  a  b. 

*k°  v  j 

j-i 


k-1,2 . r 


(3.2) 


Denote  the  r  '’estimates''  of  the  first  received  digit  by  b '  \ 

k  »  1,2 . r.  One  additional  estimator  is  the  identity  b  ^  •  b  .  Now 

1  o  o 

set  the  decoded  symbol  equal  to  that  value  of  GF(p)  assumed  by  the 

(k) 

largest  fraction  of  the  r  +  1  estimates  •.  For  a  cyclic  code  this  scheme 


for  decoding  tit:  first  digit  also  decodes  the  other  v  -1  digits. 
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In  a  balanced  Incomplete  block  design,  we  have  v  objects  arranged 
in  b  blocks.  Each  block  contains  distinct  objects.  Each  object  occurs 
r  times  and  each  pair  of  objects  occurs  together  in  A  times.  Block  design 
is  conveniently  represented  by  a  b  by  v  incidence  matrix  S^s^J  where 
s.  .  =  1  if  i-th  block  contains  j-th  element,  s.  =  0  otherwise.  Elemently 
conditions  for  the  existence  of  a  (v,  kj.r.b,  A)  balanced  incomplete  block 
design  are  as  follows. 

1.  vr  *  b  kj 

2.  A(v-l)  =  r(k1-l)  (3.3) 

If  the  parity  check  matrix  A  is  the  incidence  matrix  S,  then  the  r  by  v 
submatrix  will  have  r  l's  in  its  leftmost  column  and  A  l’s  in  all  other  columns. 
This  leads  to  a  set  of  r  +  1  estimators  (including  the  identity  with  each  b. 
appearing  in  no  more  than  A  equations.  The  decoding  algorithm  is  capable 
of  correcting  any  combination  of  e  or  fewer  errors  where  e  =[r/2Aj. 

The  brackets  denote  "integer  part  of.  " 

A  balanced  incomplete  block  design  is  called  cyclic  if  every  cyclic 
permutation  of  a  row  of  the  incidence  matrix  A  is  also  a  row  of  A.  One  well- 
known  class  of  cyclic  designs  is  associated  with  projective  geometries. 

Denote  by  PG(m^,q)  the  projective  geometry  of  dimension  over 
GF(q).  For  each  u(l  <  u  <  rc^),  one  can  associate  a  cyclic  incidence 
matrix  A  such  that  the  columns  correspond  to  the  points  and  the  rows  corre¬ 
spond  to  all  possible  u -spaces  of  PCHm^q).  The  error -correcting  ability 
is 


e 


l2(qU  -1)J 


(3.4) 


The  number  of  check  digits  however  is  not  known  explicitly.  A  computational 
procedure  is  of  course  possible  by  determining  the  rank  of  A. 
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3.3  A  Method  fen:  Determining  the  Generator  Polynomials  of  Projective 
(geometry  Codes 

Rudolph  describes  his  projective  geometry  codes  through  the  parity 
check  matrices.  In  this  section  a  procedure  is  described  for  finding  the 
generator  polynomials  of  a  new  class  of  cyclic  codes.  It  is  shown  that  the 
code  polynomials  of  the  codes  generated  do  satisfy  Rudolph’s  parity  check 
equations.  Hence  the  new  codes  are  subcodes  of  the  codes  specified  by 
Rudolph. 

s  m 

Let  q  ®  p  .  Let  a  be  a  primitive  element  of  GF(q  ).  Let  n  be 

(qm-l)  /  (q-1).  As  the  elements  a  \l  <  i  <  m^,  =  m-1)  are  linearly 

independent  over  GF(q),  we  may  write 

v..  a*  0  <  j  <  qm-2  (3.5) 


where  v.  ^  is  in  GF(q). 


Arrange  the  coefficients  in  matrix  form  and  define 


-  " 

u 

ml 

i 

< 

3 

o 

v  ,  .  . 

m^l 

vm1(n-l) 

G  - 

• 

U1 

a 

V10 

•  •  • 

V11 

V1 (n-1) 

v00 

• 

o 

> 

V0(n-1) 

(3.6) 


We  can  associate  each  colunw.  .corresponding  to  a1)  of  this  matrix 
to  a  point  of  PGfm^q)  [l]  • 

An  u-space  of  FGim^.q)  is  defined  to  be  the  totality  of  the  points 
linearly  dependent  upon  a  set  of  u  +  1  linearly  independent  points.  A  vector 
associated  with  an  u-space  is  a  vector  such  that  its  component  is  equal  to  1  if 
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the  position  of  this  component  corresponds  to  a  point  contained  in  this  u -space 

and  the  component  is  equal  to  0  otherwise. 

We  first  show  that  the  vectors  associated  with  u -spaces  can  be 

expressed  as  Linear  combinations  of  vectors  related  to  the  product  of  the 

powers  of  the  vectors  u  ,  u, ,  . . . ,  u  .  Let 

ol  m. 


uiV<viO’  vil . vi<n-l)  >  <vj0’  vjl . vj(n-l)> 


<viOvjO'  vilvjl . vi(n-l)vj(n-l)>'  °i  ‘'if  ml 


(3.7) 


where  v. ,  v. .  is  a  product  of  v. .  and  v..  in  GF(q).  For  a  positive 

1KJK  £  IX  JK 

integer  f .  ,  define  u. 1  to  he  u.  multiplied  by  itself  i.  times.  Define 


o  o 
UI  *u0  U1 


(1,1,....  1). 


(3.8) 


Let  L  be  a  matrix 
u 


l  l.  m. 

o  1  1 

-  *  urt  u ,  . . . ,  u 
u  0  1  mi 


(3.9) 


*o  ^  rnx 

whose  rows  are  the  vectors  un  u.  ....  u_  with  l,  l,,  ....  I 

0  1  nrij  o  1 

satisfying  the  condition 


1 

i.  -c(q-l);  0<c<(m1-u),  0  <  i.  <q-l 


(3. 10) 


The  u -space  which  contains  a  °,  a  .....  o  u,  corresponds  to  the 


vector 


1  JLl  ,  q-1  . 

-  TT  (u  -  u, ) 

m.  -u  .  H  v  i  I ' 

.  1  i-u  + 1 


(3. 11) 
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This  vector  is  in  the  row  space  of  L  .  Consider  any  u -space  con- 
dQ  d  d  u 

taining  a  0  ,  a  . . . ,  a  u  linearly  independent  points  over  GF(q),  then 
there  exists  a  nonsingular  matrix  M  over  GF(q)  such  that 


r  °  i  u*i  wr  do  di  aui  , 

la  ,  a  ,  a  J=  Mj^a  ,  a  ,  . . . ,  a  J  (3. 1 

Let  be  . x.,  xlT  »  M  [u  ,  ....  uv  u  1T.  The  vector 

L  m^  1  oj  *-  m^  l  oJ 

ml 

-AkttTT  (x?  *  -  Uj  Correspond.’  to  the  u -space 
(-1)  1  i«u+l 


(3. 12) 


containing  d  ,  dj . d  .  One  can  easily  verify  that  this  vector  is  a 

°  U  lo  *ml 

lineary  combination  of  the  vectors  Uq  u 1 . . . .  um  satisfying  the 

condition  (3. 10).  Thus  we  have  shown  the  following. 

Lemma  3. 1  Any  vector  corresponding  to  an  u -space  is  in  the  row 

space  of  Ly  over  GF(q). 


,  I-,  h  ,  h  ji-1 

'u  "I1  a  - > 


(3. 13) 


where  h  is  an  integer  less  than  q  -1  and  is  a  multiple  of  q-1  and  the  weight 
of  h  over  base  q  is  no  more  than  ( m ^  -  u)  (q-1). 

We  want  to  show  that  the  row  space  of  G^  over  GF(q)  is  identical 
to  the  row  space  of  L^  over  GF(q). 


•z 


h.  q  ;  0  <  h.  <  q-1 

i  ^  -  l  -  ^ 


(3. 14) 


1  A  A 

h-  ^  h.  +  hi  (q*-l)  -  w  (h)  +  fv(ql-l)  (3.15) 


q  -1  is  divisible  by  q-1.  Thus  h  is  divisible  by  q-1  if  and  only  if  w^(h)  is 
divisible  by  q-1. 
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Any  integer  h  in  the  matrix  Gu  satisfies 

Wq(h)  *  c(q-l)  ;  0  <  c  <  (m^-u)  (3.16) 

We  first  show  that  any  row  vector  of  Gu  is  a  linear  combination  ctf 
the  rows  of  Ly.  Consider  a  typical  row  of  matrix  (3. 13), 

[l  ah  - (a1*)11'1]  (3.17) 


The  matrix  consists  of  m  rows.  Let 
™1 

The  typical  column  ( a  **)  j  is  actually  as  the  following. 


(3. 18) 


mjj 


CU 


oj 


(3. 19) 


We  need  only  to  show  that  c.  .  (m.  <  i  <  0  )  is  a  linear  combination 

l0  1  lm  ~ 

over  GF(q)  of  the  terms  v  v,  .  ...  v  *  with  l  ,  l.,  .....  t 

oj  lj  m^j  o  1  m. 

satisfying  condition  (3.  10)  and  this  bnear  combination  is  independent  of  j. 


From  equations  (3. 5)  and  (3. 14),  we  have 

ml 

(orh)  j  -  (aJ)  h  -f  Vi j  ) 

i-0  ' 


htq 


(3.  20) 
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By  expanding  equation  (3.20) 


v-3(  | 

-tHE  », 

\0<  ly  ^2*  *  *  *  9 


v.  a 

V 


(ij+i^ . .  •  +  ih )  q 


(3.21) 


In  the  t-th  factor  of  equation  (3. 21),  each  term  in  the  summation  has 
as  coefficient  v.  .  v . .  v.  .  which  is  a  product  of  h  of  the  factors 

V  V  \J  - 

v0J,  vm  .  with  repetitions  permitted.  By  expanding  equation 


(3.21) 


<-h>*  -  “f<h) 

f 

Where  f(h)  depends  only  on  h  but  not  on  j,  b, .  has  the  form 

r  ] 

1'0  V  mr 

bfj“v0j  Vlj  vm.j 


(3.22) 


(3.23) 


£  ‘i  ■  £  ht  ■  Wq(h) 


(3.24) 


<  i.  <  q-1  and  /.  ®  t'  mod  q-1 

-  1  -  n  li 


l  t.  m. 

.  o  1  1 

b.  »  vn .  v,  - v 

fj  Oj  lj  mjj 

with  lQ,  ...,  satisfying  condition  (3. 10). 


(3.25) 
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can  be  expressed  as  a  linear  combination  over  GF(q)  of  1, 
Equation  (3. 22)  can  be  rewritten  as 


S  cu 


(3.  26) 


where  c.  ^  is  a  linear  combination  over  GF(q)  of  b^ ..  Thus  we  have  shown 

that  the  row  space  of  Gu  over  GF(q)  is  a  subspace  of  the  row  space  of 

over  GF(q).  The  dimension  of  the  row  space  of  Gu  is  equal  to  the  number 

of  h  satisfying  condition  (3. 16).  The  dimension  of  the  row  space  of  Lu  over 

GF(q)  is  no  more  than  the  number  of  rows  of  Lu-  Let  the  integer  h  be 

corresponding  to  (l,  I  )  if  h.  «  l.  for  0  <  i  <  m,  .  The 

oi  nij  ii  —  —  1 

number  of  h  satisfying  condition  (3.  io)  and  the  number  of  (Iq,  ly  . . . ,  ) 

satisfying  condition  (3.  10)  are  equal  because  there  exists  one  to  one  corre¬ 
spondence  between  these  h's  and  these  (iQl  ly  ....  I  )'s.  The 
dimension  of  the  row  space  of  Gu  is  no  less  than  the  dimension  of  the  row 
space  of  Ly.  Hence,  we  have 

Lemma  3.  2  The  row  space  of  the  matrix  G^  over  GF(q)  is  equal 
to  the  row  space  of  the  matrix  over  GF(q). 

Theorem  3. 1  Let  C  be  a  code  with  a  ^  as  a  root  of  its  generator  poly- 
nor  d  g(x)  if  and  only  if  h  is  divisible  by  q-1  and  satisfies  the  following 
condition. 


0  <  w  (hpj)  <(m  -  u)(q-l)  ;  0  <  j  <  s-1 


(3.27) 


w^hp^  )  is  a  digit  sum  of  the  q-ary  representation  of  hp^.  The  null 
space  of  C  contains  all  u -spaces  of  PG(m^,q). 

Proof:  From  lemma  3. 1  and  lemma  3.  2,  any  vector  associated  with  an  u- 
space  is  in  a  code  over  GF(q)  whose  generator  matrix  is  equal  to  G^.  The 
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parity  check  matrix  in  this  code  is 


H  = 
u 


X  a 


h' 


<*v-' 


(3.28) 


where  h'  is  an  integer  less  than  qm-l  and  satisfying  the  condition 


q-1 


h' 


0<  w^h’)  <  (u+lXq-1) 


(3.29) 


h* 

Let  v(x)  be  a  polynomial  associated  with  an  u -space,  a  1  are  roots 

of  v(x)  for  h'  satisfying  condition  (3.29).  v(x)  has  its  coefficients  over  GF(p). 

H*  h*D 

a  is  a  root  erf  v(x)  implies  a  n  "  are  roots  of  v(x). 

A  code  whose  generator  matrix  is 


1  a 


(a 


(3.30) 


with  h'  satisfying  condition  (3.29)  satisfies  the  parity  checks  associated  with 
all  u -spaces  of  FG(m1>q). 

Thus  the  generator  polynomial  of  this  code  has  a  h  as  its  roots  for  h 
satisfying  condition  (3.27).  This  theorem  is  proved. 

BCH  bound  for  this  code  can  be  seen  as  follows.  Let  h  *  t(q-l)  and 
ml 

»  j  m  -u  m  -u-1 

h  *  y  h.q  where  9*1^  h.  >0.  For  0  <  t  <q  +  q  + 

1-0 

-  +  q.  h.  equal  to  0  for  nij  >  i  >  (m^  -u  +  1)  and  not  all  other  h,'s  equal 

to  q-1,  hence  w^  (t  (q-l))<  (m^-u  +  1)  (q-1).  One  can  easily  verify  that 

w  (t  (q-1)  p^)  <  (mj-u  +1)  (q-1)  for  all  these  t' s.  Thus  the  cock:  C  defined 
^  m^-u  m^-u-1 

in  theorem  3.  1  contains  q  +  q  +....+  1  consecutive  roots. 

Weldon’s  decoding  algorithm  for  non-primitive  Re?d-Muller  codes,  i.e.  u-step 

orthogonal! zation  procedure  (_18]  and  Rudolph's  one  step  decoding  algorithm  in 

section  3.  2  utilize  the  fact  that  the  null  space  of  the<r  codes  contain  all  u -spaces 
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of  PG(m1>q).  Their  algorithms  can  be  applied  to  this  code*  Some  possible 
improvement  of  Rudolph's  decoding  algorithm  with  be  presented  in  the  next 
section.  The  guaranteed  decodable  distance  by  u-step  algorithm  is 

m.  -u 

d  =  2+q  + . +  q  (3.31) 

which  is  identical  to  the  BCH  bound  of  the  code  C  in  theorem  3. 1.  The 
guaranteed  decodable  distance  by  one  -step  algorithm  is 

d1=[(qmi-  1)  /  (qU-l)J  +1  (3.32) 

d  is  greater  than  dj  in  general,  but  the  decoder  for  one -step  decoding 
algorithm  may  be  simpler  than  the  decoder  for  u-step  decoding  algorithm. 

In  Table  3. 1,  we  list  some  binary  codes  from  theorem  3.  1.  n,  k  are 
the  code  length,  the  number  of  information  digits,  d  and  d^^  are  defined  in 
equations  (3.  31)  and  (3.  32)  respectively.  The  number  of  information  digits 
of  C  is  identical  to  the  number  of  information  digits  of  Rudolph's  projective 
geometry  code  listed  in  reference  [l6]  .  Thus  theorem  3.  1  establishes  the 
generator  polynomials  for  all  Rudolph's  projective  geometry  codes  listed  in 
Table  3. 1.  We  will  see  later  that  some  of  the  codes  in  Table  3.  1  have  more 
information  digits  than  those  of  Weldon’s  non-primitive  Reed-  Muller  codes. 


Table  3. 1  Binary  Cyclic  Codes  (n,  k,  d,  d^)  Associated  with  FG(m^,  2  ) 


u  =  m,-2 

u  =  irij-3 

u  =  ~4 

(15,4,8,8) 

(31, 15,8,6) 
(63,41,8,5) 

(31,5, 16, 16) 
(63,21,16,11) 

(63,6,32,32) 

(85,24,22,22) 
(341,195,22,  18) 
(1365,1063,22,17) 

(341,45,86,86) 

(1365,483,86,69) 

(1365,78,342,342) 

(585, 184,74,74) 
(4681,3105,74,66) 

(1681,590,586,586) 

(4369, 1568,274,274) 

| 

| 

- - 

We  next  show  that  this  code  contains  Weldon’s  non -primitive  Reed- 
Muller  code  of  corresponding  parameters  as  a  subcode. 

Let  h  be  an  integer  less  than  qm-l.  Let  q-ary  representation  of 


i 

2  hi«'  •  °5  h,  i  'I-1 


(3.33) 


Let  q  *  p  .  Let  p-ary  representation  of  h.  be 


i.  =  ^  h.  .  p*  ;  0  <  l  <  nij  ,  0  <  h.  <p-l  (3.34) 
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)  £-j  ij  ’ 


0  <  j  <  s-1 


(3.35) 


The  weight  of  h  over  base  p  is  defined  as  tne  "digit"  sum  of  the  p- 


presentation  of  h,  that  is 


1  s-1 


(3.36) 


i=0  j=0 


The  condition  for  h  such  that  a  is  a  root  of  the  generator  poly  - 
nomial  of  the  projective  geometry  code  specified  in  theorem  3. 1  is  equivalent 
to  the  following  condition 


wq(h)  =‘Wo  +  pW1  + . +  p3'1  Wgl  =  (ps-l)  kQ 

w  (hp)  =  pWo  +  p2W1+ . +ws-i  =(PS‘1)k1 


(3.37) 


•  .  * 

w  (hpS'1)  =  pS'1Wo  +  W1+ . +pS’  Ws.1=(pS-l)  k8ml 


where  0  <  L  <(11^  -  u) 

From  equation  (3.37),  we  have 


(l  +  p+  ....  +pS'1)(  £  W.)-(pS-l)<  ^  k.  ) 

j*0  j*0 

s-1  s-1 

^  Wj-(p-l)  k.  <  s(m1-u)  (p-1) 
j«0  j*0 


(3.38) 


Only  binary'  case  is  treated  explicitly  in  Weldon's  paper  [l8]  .  Let 

g(x)  be  the  generator  polynomial  of  the  non -primitive  Reed -Muller  cofie  of  the 

s  h 

same  parameters  q  *  2  ,  and  u  as  in  theorem  3.  1.  a  is  a  root  of 
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m  s 

g(x)  if  and  only  if  h  is  an  integer  less  than  2  -1  such  that  h  is  divisible  by 

2s -1  and  the  weight  of  h  to  the  base  2  is  no  more  than  s(m1-  u). 

From  equation  (3. 38),  we  have  shown  that  the  non -primitive  Reed- 
Muller  code  is  a  subcode  of  the  corresponding  projective  geometry  code 
defined  in  theorem  3. 1. 

In  next,  we  show  that  the  special  case  u  **  nij  -  1,  Weldon's  code  is 
identical  to  the  code  defined  in  theorem  3.  i.  For  p  being  any  prime  and 
u  =  m^  -  1,  Weldon's  non -primitive  Reed-Muller  code  contains  roots  ah  for 
h  is  divisible  by  q  -  1  and 


wp(h)  «  W.  <s(p-l) 
j=0 


(3.39) 


For  nonzero  h  which  is  a  multiple  of  pS-l,  it  is  known  [l8j  that  Wp(h) 
is  no  less  than  s(p-l).  From  equation  (3.  39), 


wp(h)  =  Wj  3  s(p_1> 


(3.40) 


Since 


h*  S  hiql  =  S  hi +  E  h*(q1'’ 


(3.41) 


that  h  is  a  nonzero  integer  divisible  by  q  -1  implies  that  ^  h.  is  also  a 


nonzero  integer  divisible  by  q-1.  Thus 


Wp(£  hi  )>  s(p-l) 


(3.42) 
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But 


m. 


m. 


w 


3<  2  hi}  -  E  Wp(hi>  =wp<h)“S(P'1) 


i=0 


i=0 


Equations  (3. 42)  and  (3. 43)  imply  that 


m„ 


m. 


v  Z  hi  >  *  Z  w 


1=0 


1=0 


(3. 43) 


(3.44) 


This  implies  that 


m. 


Wj  =  Z.  hij  -P_1  *  0  5  i  t  S_1 

i=0 


(3. 45) 


From  equations  (3.  40)  and  (3. 45' 


Wj  -p-l  ; 


0  <  j  <  s-l 


(3. 46) 


For  u  equals  to  n^  -  1,  Weldon's  non-primitive  Reed-Muller  code 
contains  root  a  h  for  nonzero  h  satisfying  condition  (3.  46)  and  for  h  being 
equal  to  zero.  These  h  satisfy  the  condition  (3.  37).  Weldon's  code  in¬ 
cludes  the  code  defined  in  theorem  3.  1  as  a  subcode  for  u  *  rOj  -  1  case. 

But  the  former  code  is  a  subcode  of  the  latter  code  in  general.  Two  codes 
are  identical  for  u  =  m^  -  1  case. 

The  number  of  ways  to  obtain  p  -  1  as  ordered  sum  of  m  nonnegative 
integers  is  (P  ^  *),  hence  the  total  number  of  h  satisfying  equation  (3.46) 

is  (P^  .  The  number  of  check  digits  for  the  new  code  is 


r  ,-i  +  (p+m'2)3  -1+  fp  +  mr1)! 
mj-1  \  m-i  J  \  nij  / 


(3.47) 


This  is  an  upper  bound  for  the  number  of  check  digits  for  Rudolph’s 
projective  geometry  code  for  u  being  equal  to  m^  -  1. 


For  social  case  =  2,  the  number  of  check  digits  is 


rm,-»  =1+  <P2+1)S 


(3.48) 


In  reference^],  Graham  and  MacWilliams  have  shown  that  the  number 
of  check  digits  for  any  difference -set  cyclic  code  which  is  identical  to  Rudolph  s 

projective  geometry  code  for  nij  equal  to  two  and  u  equal  to  one  is  equal  to 
rm  in  equation  (3.48).  In  this  case,  the  noo-primitive  Reed-Muller  code 
and  the  code  specified  in  theorem  3. 1  are  identical  to  Rudolph's  projective 
geometry  code. 

The  non -primitive  Reed-Muller  code  does  not  equal  to  the  code  defined 

in  theorem  3. 1  in  general.  For  p  equal  to  two,  the  non-primitive  Reed- 

h  tns 

Muller  code  contains  roots  a  n  for  h  loss  than  2  -1  and  satisfying 


O  X 

2  v(2S-.,  ; 


v  is  an  integer 


s  -1 

X  Wj  5  s*ml  '  u> 

j-o 


(3.49) 


The  condition  u>r  c  v  to  be  the  roots  of  projective  geometry  code 
specified  in  theorem  3. 1  is 


J  * 

5>t+i  2,-k,(2‘-l) 


;  0<k^<(m^-u),  0<  j%<s-l  (3.50) 


where  W.  is  equal  to  W_ .  .  for  some  z  such  that  t  +  j-  zs  is  a  non- 
t+j  t+j  -  zs  J 

negative  integer  less  than  s. 

2 

For  rtij  >  5,  q  «  2  and  -0*3;  WQ  -  0  and  «  6  is  a  solution 
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to  equation  (3. 49)  hit  not  a  solution  to  equation  (3. 50).  In  these 
cases,  the  non -primitive  Reed-Muller  codes  are  proper  subcodes  of  pro¬ 
jective  geometry  codes.  It  is  easy  to  verify  that  if  a  non -primitive  Reed- 
Muller  code  is  a  proper  subcode  of  the  projective  geometry  code  for  =  m', 
q  =  2  ,  m^-u=  i  then  the  former  is  also  a  proper  subcode  of  the  latter 

for  mj  =  m'  +  i,  q  =  2®  m^  -  u  =  &  +  i  for  any  positive  integer  i. 

In  Table  3.2,  we  list  the  paramet  rs  s,  l,  and  of  which  the  non- 
primitive  Reed-Muller  codes  are  proper  subcodes  of  projective  geometry 
codes  specified  in  theorem  3.1.  In  the  remark  column,  we  give  the  reason 
for  the  former  codes  being  proper  subcodes,  that  is,  the  W.'s  which  satisfy 
the  condition  (3. 49)  but  not  (3.  50).  Some  numerical  examples  are  listed 
in  table  3. 3. 


Table  3. 2  Cases  of  Which  Binary  Non-primitive  Reed-Muller  Codes  are 


Proper  Subcodes  of  Projective  Geometry  Codes 


s 

l  =  m^-  u 

ml 

Remark 

2 

3  +  i 

>5  +  i 

W  - 

0 

i  ,  W 

3 

2+i 

>4  +  i 

W  * 

0 

1  +  i,  W 

3 

3  +  i 

>4  +  i 

w  - 

i  ,  W 

Remark  (i  is  any  positive  integer) 


W  -2  +  i,  W.-1  +  :,  W0  *  4  +  i,  W,  »5  +  i 
o  1  x  o 


4 


2  +  i 


>5  +  i 
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Table  3.3  Numerical  examples  of  codes  in  Table  3. 2 


s 

£ 

(n,  kj) 

(n,  kj) 

2 

3 

5 

(1365,  481) 

(1365,  483)* 

2 

3 

6 

(5461,  3143) 

(5461,  3185) 

2 

4 

6 

(5461,  742) 

(5461,  1036) 

2 

3 

7 

(21845,  17532) 

(21845,  17588) 

2 

4 

7 

(21845,  9048) 

(21845,  9096) 

3 

2 

4 

(4681,  3090) 

(4681,  3105)* 

3 

3 

4 

(4681,  575) 

(4681,  590)* 

4 

2 

3 

(4369,  1505) 

(4369,  1568)* 

4 

2 

4 

(69905,  50779) 

(69905,  52079) 

4 

3 

4 

(69905,  4979) 

(69905,  5579) 

kj  is  the  number  of  information  digits  of  non  -primitive  Reed-Muller 
code.  k2  is  that  of  projective  geometry  code. 

A  code  containing  the  code  of  theorem  3. 1  as  a  subcode  and  also 
satisfying  the  parity  checks  associated  with  u  -dimensional  spaces  will  be 
presented.  We  prove  that  in  some  special  cases,  this  code  is  identical  to 
Rudolph's  projective  geometry  code. 

From  equations  (3.37)  and  equation  (3.  46),  the  code  Cm  ^  specified 
in  theorem  3.  1  for  u  equal  to  m^  -  1  has  its  generator  polynomial  containing 
o  h  as  roots  if  and  only  if 


indicates  this  projective  geometry  code  appears  in  Table  3. 1. 
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h  =  0 


(3.51) 


or  h  satisfying  the  condition 

(hp*  )  =  q-1  ;  0  <  j  <  s-1 


w_ 


Let  C  be  a  code  whose  generator  polynomial  consists  of  roots  a 

h<‘> 

which  is  a  product  of  the  roots  a  (  1  <  i  <  m,  -  u)  of  the  generator 

h(i)  -  -  1 

polynomial  of  Cm  _j  .  a  are  not  necessarily  distinct. 


(3.52) 

h 


nij-u 

s 

i=l 


,0) 


(3.  53) 


From  equations  (3. 51),  (3. 52)  and  (3.  53) 


mj-u  m^-u 

Wq^P* )  -  wq  (  ^  h(lV  >5  ^  wq(h^V  “u)  fa"1)  (3-  54) 

i=*l  i*l 

h^  are  multiples  of  q-1,  then  h  is  a  multiple  of  q-1.  From  equation  (3.  54)  , 

wq(hP*)  =  vj(q-l)  ;  0  <  j  <  s-1  ,  0  <  v.<(mru)  (q-1)  (3.55) 

Hence  the  code  Cy  contains  the  code  specified  in  theorem  3. 1  as  a 

subcode.  We  want  to  show  that  Cy  has  all  u -dimensional  flats  as  parity 

checks.  Furth  _  rmure  Cy  is  identical  to  a  Rudolph’s  projective  geometry 

cotfc  when  .  is  equal  to  a  Rudolph’s  projective  geometry  code. 

The  generator  matrix  of  the  dual  code  of  C  ,  is 

m,  -l 

G  .  -  J  1  a  h  -  (a  h  j"'1 

mrl  [ 


(3.  56) 
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for  h  equal  to  zero  or  h  satisfying 


W.  =  p  -  1  ;  0  <  j  <  s-1  (3. 57) 

where  W.  is  defined  in  equation  (3. 35). 

Let  (  iQ,  . '  ^  corresponding  to  h  if 


£iahi  - 


0  <  i  < 


(3.58) 


Then  the  row  space  of  G  ,  can  be  shown  to  be  identical  to  the 

m^-1 

row  space  of 

l 

umi  (3.59) 

ml  J 


r  ii 
0  1 

-  ,  “  u  u, 

m^-1  jo  1 


with  (£q,  2^ . ,  £m  )  corresponding  to  the  h  equal  to  zero  or  h 


satisfying  equation  (3. 57)  as  follows.  m 


m. 


(«V  ■  (a^b  »(  ^  v.  ^a1) 


1  s-1 

i  i" < 

t=  k*0 


ts  +  k 


(3.60) 


m 


m 


1 


.  .  _ 1  s-1  r-,  k  .  ts+k  h„. 

(a  >  -TT  IT  <  ^  vu  a  » 

t=*0  k=0  i*0 


(3.61) 


The  rest  of  the  proof  can  be  accomplished  by  analogy  of  the  proof  of 
lemma  3. 2. 

Let  L^  be  a  matrix  whose  row  vectors  aie  the  vector  product  of 

the  row  vectors  of  L  ,  taking  m,  -  u  at  a  time. 

m^-1  1 

The  parity  check  matrix  of  Cu  can  be  written  of  the  form  in  equation 
(3.56)  for  h  satisfying  equation  (3.  53).  Similar  argument  as  u  equal  to 
m.  -1  case,  the  matrix  L’  is  also  a  parity  check  matrix  of  C  . 

1  u  I™  /  u 
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Since  any  (n^-  1) -space  of  PG(m1>q)  is  in  the  row  space  of  I_m  ^ 
Any  u-space  can  be  considered  as  an  intersection  of  some  m^-u  number  of 
(nij-  1) -spaces,  hence  it  must  be  in  the  row  space  of  L\  In  case  that  the 
vectors  associated  with  (ir^-  l)-space  of  PG(m^,  q)  span  the  row  space  of 
L^,  the  vectors  associated  with  u -spaces  of  PG(nijfq)  als  jan  the  row 
space  of  1/ ■ 

Thus  the  code  has  all  u-spaces  of  PG^m^.q)  as  its  parity  checks. 
Cu  is  equal  to  Rudolf's  projective  geometry  code  provided  Cm  is  equal 
to  Rudolf's  projective  geometry  code. 


3.  4  On  Rudolph's  Decoding  Algorithm  for  Projective  Geometry  Codes 

Rudolph's  Decoding  Algorithm  uses  all  u-spaces  of  PG<m1>q)  for 
majority  voting.  In  this  section,  we  show  by  an  example  that  it  is  possible 
to  choose  a  set  of  u-spaces  which  are  orthogonal  on  a  point  for  majority 
voting  and  achieve  the  same  guaranteed  decodable  distance  by  Rudolph’s 
method  in  some  cases. 

Example.  For  m^  =  4,  q  =  2,  u  =  2,  we  have  binary  (31,  15)  code. 

The  Rudolph  decoding  algorithm  requires  all  105  2 -spaces  to  be  used  for 

majority  voting  and  achieve  a  guaranteed  decodable  distance  6. 

5  2 

Let  o  be  a  root  of  primitive  polynomial  x  +  x  +  1  over  GF(2), 
then  the  matrix  G  in  equation  (3.  6)  becomes 


0000100101 10011 11 10001 101 110101 
OOOlOOlOllOOlllllOOOllOUlOlOlC 
0010010110011111000110111010100 
010000100101 10011 1 1100011011101 
1000010010110011111000110111010 


(3.62) 
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We  can  choose  the  following  five  2  -spaces  orthogonal  to  the  first 

point. 

[1110010000010000001100000000000 
1001 100000100000000001000100010 
1000001100000001000000101001000 
1000000011000000100010000010100 

lOOOOOOOOOOOlllOOlOOOOOlOOOOOOlJ  (3.63) 

Ttie  guaranteed  decodable  distance  by  using  this  set  of  orthogonal 
parity  checks  is  also  6. 

It  is  impossible  to  choose  sufficient  number  of  u -spaces  which  are 
orthogonal  on  a  point  for  majority  voting  and  achieve  the  same  guaranteed 
distance  by  Rudolph's  method  in  general.  For  example,  any  two  ( m ^  -  1)- 
spaces  of  PCKm^q)  intersect  a  (nij  -  2) -space.  We  cannot  get  a  set  of  two 
or  more  parity  checks  orthogonal  on  a  point  for  m^  >  2.  The  guaranteed 
decodable  distance  by  Rudolph's  method  is 


[%->■•[’*  -M-] 

L  q  -1 J  L  q  -1  J 


+  1  a  q  +  1 


Thus  for  projective  geometry  code  associated  with  (m^  -  1)  -  spaces 
of  JPCHm^q)  with  m^  >  2,  we  cannot  obtain  q(q>2)  number  of  (m^  -  1)  spaces 
orthogonal  on  a  pxunt  for  majority  voting,  hence  we  cannot  achieve  the  same 
guaranteed  decodable  distance  q  +  1. 


38 


IV.  INVESTIGATION  OF  THRESHOLD  DECODING  FOR  CYCLIC  CODES 

Since  BCH  codes  are  most  powerful  random  error -correcting  codes, 
we  investigate  whether  all  BCH  codes  can  be  L-step  orthogonalized.  Un¬ 
fortunately,  we  find  that  a  class  of  double  error -correcting  BCH  codes  cannot 
be  L-step  orthogonalized.  On  "he  other  hand,  we  found  that  BCH  codes  with 
length  qm-l  as  well  as  Euclidean  geometry  codes  can  be  one  step  decoded 
by  parity  checks  which  are  not  necessary  orthogonal.  We  cannot  decode  these 
codes  to  their  minimum  distances  in  general.  These  codes  decoded  by  this 
method  is  comparable  to  projective  geometry  codes  decoded  by  Rudolph's 
method.  A  comparison  is  made  for  the  codes  derived  from  projective 
geometries  and  the  codes  from  Euclidean  geometries  by  u-step  decoding 
method.  For  the  same  error -correcting  ability,  the  transmission  rate  in¬ 
creases  as  code  length  increases  but  the  decoder  complexity  also  increases. 

4.1  Non -Orthogonality  of  Some  BCH  Cedes 

Massey  [l3j  in  his  earlier  work  suggested  an  important  area  of  re  - 
search  to  be  investigation  of  L-step  orthogon&lization  procedure  for  block 
linear  codes.  An  interesting  result  is  obtained  in  this  direction.  That  is, 
some  double  error -correcting  BCH  codes  cannot  be  L-step  orthogonalized. 

The  proof  essentially  consists  of  showing  no  set  of  d-1  (where  d  is  the  mirimum 
distance  of  the  code)  parity  checks  orthogonal  on  any  noise  bit  or  sum  oi  noise 
bits,  can  be  formed.  We  first  represent  a  necessary  condition  for  a  code  to 
be  L-step  orthogonalized  as  follows. 

Lemma  4.  1  Let  gQ(x)  be  the  generator  polynomial  of  a  binary  code  CQ 
and  gQ(l)  f**  0.  Let  be  an  extension  code  o*  CQ  obtained  by  adding  an 

overall  parity  check  as  its  first  digit  to  CQ.  If  is  invariant  under  a 
transitive  permutation  group,  a  necessary  condition  for  Co  to  be  L-step 
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orthogonalized  is  that 


>  <4 


d  -1 


)d' 


(4. 1) 


where  n  is  the  code  length,  d  is  the  minimum  distance  of  the  code  Cq.  d' 

is  the  minimum  distance  of  the  dual  code  C  of  C  . 

o 


Proof:  Let  x, , 


1’  x2’ 


xd  -1  <do  is  an  odd  ^nte8^r  no  more  than  d)  be 


the  set  of  vectors  in  the  code  C  which  are  used  to  form  a  set  of  dQ-l  parity 

checks  orthogonal  on  a  selected  sum  of  noise  bits  e.  +  e.  +  ...+e.  where 

1  2  y 

e.  are  distinct  noise  bits  (1  <  i  <  n).  Let  x  be  a  vector  which  is  a 
i  -  y  --  o 

sum  of  Xy  Xy  ....  x^  and  the  vector  with  all  one  entries.  Let 

o 

w.  (0  <  i  <  dQ-l)  be  the  weight  or  the  number  of  l's  of  the  vectors  x. 
respectively.  Since  dQ-l  is  even,  xq  must  have  ones  in  the  positions 
iy  i2>  i  •  It  is  easy  to  verify  that 


d  -1 
o 


2 

i=0 


(w.  -  y)  =  n  -  y 


(4.2) 


Without  loss  of  generality,  let 


wt  <  w2  <  — -  <  w 

o 


(4.3) 


From  equation  (4.2)  and  (4.3),  we  have 


wo+(do-1)wl  -n 
- - 

o 


(4.4) 


We  now  want  to  show  that 


w  +w  ■  (d'  -  1) 
o  1  v  ' 


2 


(4.5) 
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and 


w  <  n-d* 
o  — 


(4.6) 


in  order  to  prove  this  lemma.  Let  be  a  code  generated  by  (x-1)  gQ(x) 
where  g0(x)  is  the  generator  polynomial  of  CQ.  Let  C’  be  the  dual  of  Cq. 

C'  is  a  cede  contains  the  code  C  as  a  subcode.  C'  contains  the  vector  with 
all  one  entries,  hence  C’  contains  xQand  xq  +  x^  as  code  words.  The 
weight  of  xq  +  is  w0  +  w^  -  2y.  Equations  (4.  5)  can  be  established  if 
we  show  that  the  minimum  distance  of  the  code  C'  is  d’  -1.  C  is  a  subcode 
of  C'  and  C  contains  all  the  code  words  of  C'  which  have  even  weight.  It  is 
easy  to  verify  that  the  extension  code  <T  of  C’  obtained  by  adding  an  overall 
parity  check  as  its  first  digit  to  the  code  C'  is  a  dual  code  of  C  .  C 

v)v  wC 

is  a  c'vie  which  is  invariant  under  a  transitive  permutation  group  implies 
that  its  dual  code  also  invariant  under  the  same  transitive  permutation  group 
(c.f.  Theorem  11.  1  of  reference  [l4]  ).  d'  is  the  minimum  distance  of  C, 
then  d'  is  an  even  integer  and  d'  is  also  the  minimum  distance  of  C\  Since 
(T  is  invarient  under  a  transitive  group,  there  exists  a  vector  in  (T  such 
that  the  first  digit  of  vg  is  nonzero  and  the  weight  of  ve  is  d’.  The  vector 
v  obtained  by  deleting  the  first  digit  of  ve  has  weight  equal  to  d’  -l  and  v  is 
a  code  word  of  C  with  minimum  weight  d'-l.  Hence  equation  (4.  5)  can  be 
established.  The  weight  of  xq  must  be  odd.  Since  the  minimum  even 
distance  of  C’  is  d\  ’he  largest  possible  weight  of  xq  is  n-d'.  Hence 
equation  (4.6)  is  established. 

From  equations  (4.  4)  and  (4.  5) 


w  -t-  w, 
o  1 


(d’-l) 


wo  +  (do-  Dw^n 

a~7i 

o 


(4.7) 
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Rearrange  equation  (4.7),  we  have 


1  v  W1 

T^l)  V-  ~T 


d'-l 


d  -1 
o 


(4.  8) 


w  >  d’ 

Substituting  equations  (4. 6)  and  (4.  9)  into  equation  (4.8),  we  have 


(4.9) 


/  1  1  V  /_  ^  d’  n  ,  d'-l 

(7  ’  (n*d)  >  -y  -  s~zi  +  ~r 

o  o 


(4. 10) 


Rearranging  equation  (4. 10).  we  have 

1 


n+i  /  3 
— >  (y 


T^l 

o 


)  d* 


(4.11) 


From  the  property  that  C’  is  invariant  under  a  transitive  pei  mutation  group, 
we  have  shown  the  minimum  distance  of  C’  is  odd  (d’-l  is  odd).  Similarly, 
we  can  show  d  is  an  odd  integer  since  is  invariant  under  a  transitive 
permutation  group.  dQ  must  equal  to  d  when  CQ  can  be  L-step  orthogonalized. 
The  lemma  is  proved  by  equation  (4.  11). 

From  equation  (4.  11) 


J 


do’1  1 


2d’ 

3d' -n  - 1 


(4.  12) 


The  maximum  number  of  parity  checks  orthogonal  on  any  noise  bit  or 
sum  of  noise  bits  is  no  more  thanj. 

Let  a  be  a  primitive  root  of  GF(2m).  A  binary  NBCH  code  CQ  is 
defined  to  be  the  code  consists  of  a  \  a  ^ ,  ....  a  d  *  consecutive  roots. 

The  extension  code  obtained  by  adding  an  overall  parity  check  to  CQ 

as  its  first  digit  is  invariant  under  a  transitive  group.  Lemma  4. 1  is 
applicable  to  this  code. 
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Theorem  4. 1  All  double  error -correcting  binary  NBCH  codes  CQ 
cannot  be  L-step  orthogonalized  for  m  >  7. 

Proof:  The  minimum  distance  of  these  code  is  at  least  5.  Let  dQ  =  5, 
and  using  equation  (4. 11),  we  have 


(2m  -  1)  +1  3 

- 2 - -  <  T 


5-1 


)d’ 


(4. 13) 


or 


2m+1  >  5d' 


For  m  to  be  odd,  d'  is  equal  to  2m  1  -  1  [8]  .  Equation  (4.  13) 

becomes 


2m+l  >  g ^m-l  -  2^m+  ) 

or  ^>2(m+l)/2-l  >  2m_1 

This  condition  cannot  be  satisfied  for  m  greater  than  or  equal  to  7. 

For  m  to  be  even,  d'  is  equal  to  2m  1  -  2^m+2^2  Equation 
(4. 13)  becomes  2m  +  1  >  5  (2m-1  -  2(m  +  2)/2_1  )  or  5  •  2m/2>  2m’1. 

This  condition  cannot  be  satisfied  for  m  greater  than  or  equal  to  8. 

Thus  we  have  proved  the  theorem. 

In  this  theorem,  we  have  shown  that  all  double  error -correcting  NBCH 
codes  CQ  cannot  be  L-step  orthogonalized  for  m  >  7.  In  next,  we  show 
that  the  binary  double  error -correcting  NBCH  code  Cq  cannot  be  L-step 
orthogonalized  for  m  equal  to  5.  The  dual  C  of  CQ  has  vectors  of  the 
following  four  possible  weights  0,  12,  16,  20  [8]  .  We  use  same  notation  as 
in  the  previous  lemma  and  theorem.  A  vector  in  the  code  C  but  not  in  C 
has  weight  equal  to  11,  15,  19,  31. 
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(wQ-y)  +  (w1-  y)  + (w2- y)  + (w3-y)+ (w4- y)  =  n-y  (4.14) 

(wQ-  y)  +  (Wi-y)  >11  (4.15) 

(Wj-  y)  +  (w2~  y)  >  12  (4.16) 

Let  Wj<w2  <w3  <w^  ,  equation  (4. 14)  implies 

y  >  (wq  +  w  j  +  3  w2  -  31)  /4  (4. 17) 

Equations  (4.  15)  and  (4. 16)  imply  that 

y  <  (w0  +  w!  -  1X)  /2  (4* 18) 

y  f(Wj  +  w0  -  12)  /2  (4. 19) 


With  the  restrictions  that  Wj  <  w2>  w.  (i=l,2)  must  equal  12,  16,  or 

20,  wq  must  equal  to  11,  ’5,  or  19,  one  can  verify  that  there  does  not  exist 

a  positive  integer  y  such  that  equations  (4. 17),  (4.  18)  and  (4.  19)  are  satisfied 
simultaneously.  This  shows  that  Cq  cannot  be  L-step  orthogonalized  for 
m  “5- 

4.2  One -Step  Majority  Decoding  of  Some  Cyclic  Codes 

In  this  section,  we  show  a  method  to  decode  all  cyclic  codes  whose 
extension  codes  are  invariant  under  a  doubly  transitive  permutation  group 
and  also  to  decode  the  cyclic  codes  which  have  an  addition  parity  check  bit 
than  the  previous  ones. 

m  s 

Let  CQ  be  a  q-ary  code  with  length  n=q  -1  and  let  q“p  where  p 

is  a  prime  number.  The  extended  code  C  of  C  is  a  code  with  an  overall 

r  oe  o 

parity  check  to  CQ  as  its  first  digit.  The  first  position  of  a  code  vector  in 

2 

is  numbered  0,  the  i-th  position  for  i  >  1  is  numbered  a1  where  a 
is  a  primitive  element  in  GF(qm).  Thus  the  qm  positions  of  a  code  vector 
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is  numbered  by  qm  elements  in  GF(qm).  An  affine  transformation  with 
parameters  a,  b  belong  GF(qm),  a  ^  0  is  a  permutation  which  carries  the 
symbols  in  position  X  to  position  aX  +  b.  Such  a  transformation  can  be 
applied  to  any  extended  code  associated  with  a  primitive  element  of  GF(qm). 
A  code  will  be  called  invariant  under  the  affine  group  if  every  affine  per¬ 
mutation  carries  every  code  word  into  another  code  word  The  necessary 
and  sufficient  condition  for  a  code  to  be  invariant  under  the  affine  group  is  as 
follows  [6)  . 

Let  g(x)  be  the  generator  polynomial  of  CQ.  Let  i  be  a  positive 
integer  less  than  qm.  Let  J(i)  be  the  set  of  nonzero  integers  j  such  that 
each  coefficient  of  the  p-ary  representation  of  i  is  greater  thaw  or  equal  to 
the  corresponding  one  of  j.  The  extended  code  is  invariant  under  the 
affine  group  of  permutations  if  and  only  if  for  every  a  1  which  is  a  root  of 
the  generator  polynomial  g<x),  for  every  j  belongs  J(i),  a J  is  also  a  root 
of  g(x)  and  g<l)  ^  0.  Let  C  be  the  dual  of  CQ.  Let  h^x)  be  the  generator 
polynomial  of  C.  Let  h*(x)  be  the  reciprocal  polynomial  of  h(x).  Then 

m  , 

h*(x)  g(x)  *  xq  '  -1 

(x-1)  is  not  a  factor  of  g(x)  implies  that  x-1  is  a  factor  of  h(x).  Let  C'  be 
a  code  generated  by  h(x)  /  (x-1),  then  the  extension  code  is  invariant 
under  the  affine  group  of  transformations. 

We  are  concerned  with  the  decodit^  erf  the  codes  C  and  C*.  Kasami 
et  al[?]  have  shown  the  connection  between  binary  codes  which  are  invariant 
under  a  doubly  transitive  group  and  their  connection  with  balanced  incomplete 
block  design.  Rudolph  has  decoded  projective  geometry  codes  by  using  the 
property  of  balanced  incomplete  block  design.  The  argument  to  prove  the 
following  Lemma  is  similar  to  their  argument. 
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Lemma  4. 2  Let  n  be  the  code  length  of  the  code  C.  Let  k1  be  the  minimum 
weight  of  a  vector  in  C  where  C  is  an  extension  code  of  C  which  is  a 
dual  of  C.  C  can  be  decoded  by  one  step  decoding  to  the  distance  of 
[n/  (kj-  1)]+  1. 

Proof:  Let  v  be  a  vector  of  minimum  weight  in  C  .  Let  E  be  the 
equivalence  class  of  v  under  the  doubly  transitive  permutation  group.  Let 
A  be  a  matrix  whose  rows  are  vectors  in  E.  Since  the  permutation  group 
is  doubly  transitive,  there  exists  a  permutation  which  will  permute  i-th 
column  of  A  to  the  ij  -th  column  of  A  and  the  j  -th  column  of  A  to  the  -th 
column  of  A.  Since  the  permutation  leaves  the  rows  of  A  invariant  except 
it  rearranges  the  rows  of  A,  it  follows  that  the  numbers  of  nonzero 
entries  in  the  i^ -th  column  and  i-th  column  are  equal.  The  number  of 
nonzero  entries  in  corresponding  positions  i  and  j  columns  are  equal  to  the 
nonzero  entries  in  ij  and  columns.  Let  the  number  of  nonzero  elements 
in  any  column  be  r  and  the  number  of  nonzero  elements  in  corresponding 
positions  of  any  two  columns  be  *.  Let  A ^  be  the  matrix  obtained  by 
deleting  the  first  column  of  A.  Let  A9  be  the  matrix  obtained  by  deleting 
all  rows  of  A^  whose  leftmost  entries  are  zero.  Aj  is  a  r  by  u  matrix 
with  all  entries  in  the  first  column  nonzero  and  exactly  A  number  of  nonzero 
entries  in  any  other  column.  The  row  vectors  of  A^  are  parity  checks  of 
the  code  C.  Hence  C  can  be  decoded  to  the  distance  [r.Aj+  1.  If  we 
change  all  the  nonzero  entries  of  the  matrix  A  into  1,  the  new  matrix  is  an 
incidence  matrix  of  a  balance  incomplete  block  design. 

Thus 

h/  (x1  -1)  =  x/K 


We  have  proved  the  lemma. 
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In  next,  we  show  how  to  decode  a  code  C’  which  has  one  more 
information  digit  than  that  of  C. 

Lemma  4. 3  Let  C'  be  a  cyclic  code  whose  generator  polynomial  is 
h(x)  /  (x-1)  where  h(x)  is  the  generator  polynomial  of  the  code  C  in  lemma 
4. 2.  C'  can  be  decoded  to  the  distance  ^/(k^-  1)J  by  one  step  decoding 
method. 

Proof-  Let  the  dual  of  C'  be  C1  ,  then  C'  must  be  subcode  of  C  .  We  want 

o  o  o 

to  sb.w  that  the  row  vector  of  the  matrix  (defined  in  the  proof  of  lemma 

4.2)  is  in  the  code  if  and  only  if  the  overall  parity  check  bit  added  to  this 

row  vector  is  equal  to  zero.  The  total  number  of  such  vectors  is  r  -  * 

because  the  number  of  nonzero  elements  in  corresponding  positions  of  t  ie 

first  two  columns  of  the  matrix  A  is  A.  Consider  the  (r  -l)xn  submatrix 

of  Aj  whose  rows  consist  of  these  r  -  *  vectors,  all  entries  in  the  first 

column  is  nonzero  and  at  most  number  of  nonzero  entries  in  any  other 

column.  We  can  decode  O’  to  the  distance  [n/Ck^  *  l)j  by  the  same 

argument  of  lemma  4.  2  provided  all  row  vectors  of  the  .submatrix  are  in  Ch 

We  prove  this  as  follows.  The  generator  matrix  of  C'  had  the  form 

o 

e  e. 

1  o  1  •  -  •  ( a  1  )° 


i 

1  a 

- 


e. 

/  1  \n 

(o  ) 


(4  20) 


where  e.  are  positive  integers. 

The  generator  matrix  of  must  equal  to 


1  1  1  .  .  1 

e  e. 

M2  ■  0  1  a  1  .  .  ( a  )n 


(4.21) 


e  e 

rv  •  l  /  1  iH 

01  a  .  ,  (a  ) 

- 
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because  the  generator  matrix  of  CQ  is  the  matrix  obtained  by  deleting  the 

first  column  of  the  matrix  M2  and  the  element  in  the  first  column  is  indeed 

an  overall  parity  check.  Any  vector  v  in  C  whose  f’rst  digit  is  equal  to 

oc 

zero  can  be  obtained  by  linear  combination  of  the  vectors  from  the  second 
row  to  the  last  row  of  the  matrix  M2  and  vice  versa.  By  deleting  the  first 
row  and  the  first  column  of  the  matrix  M2,  we  can  obtain  the  matrix  Mj. 

The  row  vector  of  is  in  the  code  if  and  only  if  the  overall  parity  check 
bit  added  to  this  row  vector  is  equal  to  zero.  This  proves  the  lemma. 

The  obvious  reason  to  choose  the  vector  v  of  the  minimum  weight 
kj  in  is  that  we  want  the  integer  [n/(k^-  1)J  to  be  as  large  as  possible 
for  a  fixed  n. 

The  extension  codes  of  NBCH  codes  are  known  to  be  invariant  under 
the  doubly  transitive  affine  group.  The  minimum  distances  of  some  binary 
NBCH  codes  are  known  from  references  [l5]  ,  we  can  apply  lemmas  4.  2  and 
4.  3  to  these  codes.  Let  n,  k,  denote  the  code  length,  the  number  of 
information  digits  and  the  guaranteed  decodable  distance  by  one  step  decoding 
method.  Some  binary  BCH  code  with  parameters  (n,  k,  d^  )  are  listed  in 
table  4.  1. 

Recall  that  the  Euclidean  geometry  code  C  with  parameters  m,  u,  q 
is  defined  as  the  cyclic  code  whose  generator  polynomial  contains  a  ^  as 
roots  for  h  satisfying  the  condition  0<  w^(  hpJ)  <  (m-u)  (q-1)  for  0  j  <  s-1 
wla  re  a  is  a  primitive  element  of  GF(q  )  and  q  »  p  .  The  dual  CQ  of  the 

Euclidean  geometry  code  C  has  the  generator  polynomial  contains 


a  "  as  roots  for  h  satisfying  the  condition  0<  w^(h)  <  u(q-l).  The 
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Table  4. 1  Binary  BCH  Codes  by  One  Step  Decoding  Method 


n, 

k, 

dl 

n. 

k, 

di 

31 

10 

7 

31 

11 

6 

63 

9 

22 

63 

10 

21 

63 

15 

13 

63 

16 

12 

63 

23 

10 

63 

24 

9 

63 

45 

5 

63 

46 

4 

127 

14 

26 

127 

15 

2. 

255 

12 

86 

255 

13 

85 

255 

16 

52 

755 

17 

51 

255 

20 

52 

255 

21 

l  \ 

511 

18 

103 

511 

19 

102 

1023 

15 

342 

1023 

16 

341 

1023 

20 

205 

1023 

21 

204 

extension  code  of  the  code  CQ  satisfies  the  n<  ^essary  and  sufficient 

condition  for  a  code  to  be  invarient  under  '‘he  dcubly  *ransitive  affine  gr  oup. 

The  contains  all  u -dimensional  flats  of  EG(m,q).  The  weight  of  a 

vector  v  associated  with  an  u -dimensional  flat  is  q  .  The  q -ary  (m-uXq~l)-th 

order  Reed-Muller  code  contains  q-ary  code  C  as  a  subcode.  The  mini- 

oe 

mum  distance  of  the  q-ary  (m-u)  (q-l)-th  order  Reed-Muller  code  is  qU[b]  - 
Thus  the  minimum  weight  of  tne  vector  v  in  is  qu.  We  can  apply 
lemma  4.2  to  Euclidean  geometry  code  C  and  the  guaranteed  decodable 
distance  is 

di  =[(qm-D/(qu-i>]*i  (4.22) 

The  modified  Euclidean  geometry  cede  has  one  more  information 
digit  than  the  corresponding  Euclidean  geometry  code.  The  guaranteed 
decodable  distance  is  [{qm-  1)  /  (qU-  1)J  by  lemma  4.3. 
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4. 3  Comparisons  and  Remarks 

The  guaranteed  decodable  distance  of  Euclidean  geometry  code  C  by 
one -step  decoding  is  dj  =j(qm- 1)  /  (qu- 1)  J  +  1  which  is  less  than  or 
equal  to  the  guaranteed  decodable  distance  d  by  u-step  decoding. 


d  «  2  +  q  + _ +  q 


(4.  23) 


because 


m  u-1  m  , 

2  +•  q  + - +qm'U-l+  q--^  -  >  1  +  2__li 

„u-l  -  u  . 

q  -q  Lq  -i 


When  u  is  not  equal  to  one,  d  is  greater  than  d^.  The  decoder  for  one -step 
decoding  method  may  be  simpler  than  the  decoder  for  u-step  decoding  method. 
For  example,  when  u  equal  to  m-1  the  guaranteed  decodable  distance  of 
Euclidean  geometry  code  C  is  1  +  q  for  one  -step  decoding  method  but  2  +  q  for 
(m-1) -step  orthogonalization  procedure.  The  decoder  for  the  former  method 
is  simpler  than  the  decoder  for  the  latter  method.  The  guaranteed  decodable 
distances  for  a  Euclidean  geometry  code  and  a  projective  geometry  code  with 
same  parameters  are  both  equal  to  d  for  u-step  decoding  method  and  both  equal 
to  dj  for  one -step  decoding  method.  We  list  Euclidean  geometry  codes  and 
projective  geometry  codes  together  for  comparison  purposes  as  Table  4.  2. 

In  this  table,  q,  m,  u  are  the  parameters  of  the  Finite  geometry  codes,  n 
and  k  under  the  column  PG(or  EG)  are  the  code  length  and  the  number  of 
information  digits  of  projective  geometry  code  (or  Euclidean  geometry  code) 
respectively.  The  guaranteed  decodable  distances  d  by  u-step  decoding 
method  and  d^  by  1  step  decoding  method  are  listed  in  the  last  two  columns 


of  table  4.  2. 
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Let  the  code  length,  the  number  of  information  digits  and  the  number  of 
check  digits  of  the  Euclidean  geometry  code  with  parameters  q,  m(m=m'),  u 
be  ne>  kg  and  rg  respectively.  Let  those  of  the  projective  geometry  code  with 
same  parameters  be  np»^p  anc*  rp  respectively.  Let  those  of  the  projective 
geometry  code  with  same  q  and  u  but  m=m'-l  be  n^,  and  r^  respectively. 
From  Table  4.2,  we  observed  that  for  m'-l>  u 


k  =  k  -  k’  -  1 
e  p  p 


(4.24) 


f  t  i  -I 

Since  n£  =  qm  -1,  =  1  +  q  +  . . .  +  qm  and  =*  1  +  q  +  . . .  +  qm  therefore 


n  =  n  -  n'  -  1 
e  p  p 


(4.25) 

From  equations  (4.24)  and  (4.25),  the  parity  checks  of  these  codes  satisfy  the 
relation 


r  =  r  -  r 
e  p  p 


(4.26) 


We  observed  that  the  number  of  parity  check  re  when  m'-l  equal  to  u  is 


,  p+m  -1  «s 
r  =  (  .  ) 

e  '  m 


(4.27) 


These  observations  can  be  explained  as  follows.  The  integer  Tp  is  equal  to 

the  number  of  distinct  elements  in  the  set  A  which  c  insists  of  integers  a  such 

that  A  .  ,  m'+l  . 

0  <^a<  q  -1 

Wq(ap^)®  (q-1)  ,  0<j<s-l,  0<Vj<(m’-u)  (4.28) 

(c.f.  theorem  3.  1) 

The  integer  r^  is  equal  to  die  number  of  distinct  elements  in  the  set  B  which 
consists  of  integers  b  such  that 
0  <b<  qm  -1 

w^(  bp^)  =  (q-1);  0  £j  <  s-1,  0<  <^(m'-l*u)  (4.  2d) 

The  integer  re  is  equal  to  the  number  of  distinct  elements  in  die  set  C 
which  consists  of  integers  c  such  that 
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„  .  „  ^  m'-l 
0  <  c  <  q 

0  <  w  (  c  pr*)<(m'-u)  (q-1),  0  <  j  <  s-1 


(4.30) 


(c.f.  theorem  2. 1) 

Clearly  the  set  B  is  a  subset  of  A.  We  first  show  that 


r  <  r  -  r’ 
e  -  p  p 


(4.31) 


by  showing  for  a  distinct  c  in  the  set  C,  there  corresponds  c  distinct  h  in  the 
set  A  but  not  the  set  B.  Let 


m'-l 

c  -  ^  fr  q*  ,  0  <  h.  <  q-1  (4.32) 

i=0 

be  an  integer  in  C.  Let 

h  =  c  +  h  ,  qm  (4.33) 

m 

where  h  ,  is  as  follo\  s. 
m 

In  case  (1),  w^(c)  is  not  a  multiple  of  (q-1).  Let  hm»  be  a  positive 
integer  less  than  q-1  and  hm»  +  w^  (c)  is  a  multiple  of  (q-1).  In  case  (2), 
w  (c)  is  a  multiple  of  (q-1),  then 

4 


wq  (c  p-*)  =  w.  (q-1)  ,  0<  j  <  s-1,  0<  v^<(m'-u)  (4.34) 

If  v.  is  less  than  (m'-u)  for  all  j,  let  hm<  equal  to  q -1.  If  there  exist  one 

j  such  that  v.  is  equal  to  (m'-u),  let  h  ,  equal  to  zero, 
j  m 

It  is  easy  to  verify  that  the  integer  h  defined  in  equation  (4. 33)  is  in  the 
set  A  but  not  the  set  B.  Two  different  elements  in  the  set  C  will  correspond 
to  two  different  elements  in  the  set  A  but  not  the  set  B.  Hence  equation(4.  31) 
is  established.  For  any  element  h'  is  the  set  A  but  not  the  set  B,  h’  can  be 


written  as 


m 


-I  h;«;  • 
1=0 


0  <  h!  <  q-1 
-  1  —  n 


It  is  easy  to  verify  that 


m'-l 

■  •  Z  o‘ 


i=0 


is  in  the  set  C.  Hence 


r  >  r  -  r' 
e  -  p  p 

Thus  equation  (4. 26)  is  established. 
When  m'  -1  equals  to  u, 

V  1  +  (P+m“’'l)S 

(c.f.  equation  (3.47)  )  and 


(4.35) 


(4.36) 


(4.37) 


(4.38) 


V  =1 

Hence  c  ^ 

We  list  the  transmission  rate  (k/n)  of  some  codes  listed  in  Table  4.  2 
with  same  guaranteed  decodable  distance  d  in  Table  4.3. 

In  table  4.3,  u  is  the  number  of  steps  required  for  decoding.  The 
quantity  in  parenmeses  is  k/n.  For  same  d,  the  transmission  rate  in¬ 
creases.  The  decoder  complexity  increases  as  the  code  length,  the  number 
of  information  digits,  the  number  of  steps  required  for  decoding  increase. 


Table  4.  3  Transmission  Rate  of  Binary  Finite  Geometry  Codes 
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V.  APPLICATION  OF  CODING  THEORY  TO  INFORMATION  RETRIEVAL 
5. 1  Introduction 

An  entry  in  theindex  file  for  the  document  collection  typically  includes 

an  identification  number  for  the  document  together  with  a  list  of  descriptors 

or  attributes  characterizing  that  particular  document.  The  descriptors  are 

commonly  chosen  from  a  dictionary  and  an  upper  bound  is  placed  on  the 

number  of  descriptors  which  may  be  chosen  to  characterize  any  single 

document.  A  "query"  to  such  a  collection  is  again  a  list  of  descriptors 

from  the  dictionary.  A  typical  dictionary  might  contain  a  number  of  N 

3  4 

descriptors  between  10  and  iO  and  the  maximum  number  of  descriptors 
would  normally  fall  between  5  and  10  [9], 

The  information  retrieval  problem  considered  here  may  be  defined 
as  follows:  Given  a  query,  we  wish  to  devise  a  process  by  obtaining  a  list 
of  documents  such  that  each  of  these  has  all  the  descriptors  possessed  by  the 
query.  In  other  to  automate  the  retrieval  process,  it  is  necessary  to  encode 
both  document  and  query  data  in  some  form  suitable  for  automatic  process. 
Two  important  methods  for  doing  this  have  been  pr,  ,  >s 'd.  One  io  ueing 
zero -false -drop  codes  proposed  by  Kautz  and  Singleton  [9].  Another  one 
is  derived  from  algebraic  coding  theory  by  Chien  and  Frazer  [3j.  Encoding 
by  the  former  method  usually  has  longer  digit  representation  than  the  latter 
method.  The  time  required  to  retrieval  is  comparatively  less  by  using 
zero -false -drop  code.  We  now  give  a  brief  summary  of  these  two  codes. 

A  zero -false -drop  code  of  order  t  (ZFD^  )  is  a  set  of  n -digit  binary  code 
words  satisfying  the  property  that  every  sum  of  up  to  t  different  code  words 
logically  include  no  other  code  word  where  the  sum  of  the  n -digit  binary  words 
is  their  digit  by  digit  Boolean  sum.  Let  each  of  the  N  descriptors  in  the 
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dictionary  be  assigned  a  unique  n -digit  binary  code  word  of  ZFD^  code. 

Each  document  is  represented  by  an  n -digit  word  which  is  obtained  by  forming 
the  digit  by  digit  Boolean  sum  of  the  code  words  of  all  of  its  constituent  des¬ 
criptors.  The  query  is  represented  in  identical  fashion.  It  follows  directly 
from  the  property  of  the  ZFD^  code  that  as  long  as  no  more  than  t  des¬ 
criptors  are  associated  with  any  one  document,  the  query  is  logically  included 
in  a  particular  document  word  if  and  only  if  all  of  the  query  descriptors  are 
included  among  the  descriptors  associated  with  the  document.  Thus  ZFD^ 
code  can  be  used  for  information  retrieval  file  and  guarantees  no  false  drop. 

The  method  of  encoding  documents  and  queries  from  the  algebraic 
structure  of  linear  error -correcting  code  are  as  follows: 

Let  be  a  linear  code  with  t -error -correcting  ability.  Let  H  be 

the  parity  check  matrix  of  The  row  space  of  H  is  in  the  null  space  V2 

T  T 

of  Vy  The  syndrome  of  a  vector  v  is  defined  to  be  vH  where  H  is  the 
transpose  of  the  matrix  H.  If  the  code  length  of  is  which  is  greater 
than  N,  the  total  number  of  descriptors  in  the  dictionary,  we  can  represent 
each  descriptor  as  a  column  of  H:  then  each  descriptor  is  represented  by 
binary  n -tuples  where  n  is  the  number  of  the  parity  check  in  the  code.  A 
document  (or  a  query)  represented  by  mod-nvn  linear  combination  of  the 
n-tuples  each  of  which  corresponds  tc  a  constituting  descriptor  of  the  docu 
merit.  A  query  is  represented  in  a  similar  manner.  We  limit  the  maximum 
number  of  descriptors  to  characterize  each  document  to  t.  The  documents 
can  be  represented  un -ambiguously  because  no  two  distinct  linear  combinations 
of  t  or  fewer  columns  of  H  are  equal.  The  retrieval  method  is  based  on  the 
following  argument.  In  an  error -correcting  code,  we  usually  choose  the 
vector  with  minimum  weight  in  the  coset  as  a  coset  leader.  A  coset  leader 
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C  ,  is  said  to  cover  a  cost  leader  C  if  C  ,  contains  1  whenever  C  contains 
d  q  d  q 

1  in  any  digit  position.  (J^  covers  if  and  only  if  w(Cj  +  )  +  w(C^  )  = 

w  (C^)  where  w(x)  =  weight  of  the  coset  leader  x.  We  can  consider  each 
digit  position  of  a  coset  leader  corresponding  to  a  descriptor  in  the  dictionary. 
A  coset  leader  corresponds  to  a  document  if  has  l's  in  the  digit 
positions  corresponding  to  constituting  descriptors  of  the  document,  and  has 


O's  in  all  other  digit  positions.  A  coset  leader  corresponding  to  a  query 
in  similar  way.  A  document  covers  a  query  if  and  only  if  w(C^  +  C  )  + 
w(C^)  -  w(C^).  In  information  retrieval,  we  do  not  have  the  coset  leader 
and  explicitly  available  to  us  for  testing.  Instead  we  have  syndromes 
Sj  and  Sq  of  and  respectively.  From  previous  discussion,  it  is 
clear  that  the  main  computational  problem  in  our  retrieval  process  is  that  of 


determining  whether  f(s^l  =  f(s  .)  -  f(s  )  where  f(s  )  is  defined  to  be 
equal  to  w(x).  In  coding  terminology,  computing  f(sd+<^  )  is  equivalent  to 
finding  the  weight  of  the  coset  leader  from  the  syndrome  of  the  coset. 


Details  of  a  retrieval  method  derived  from  algebraic  BCH  code  decoding  can 
be  found  in  reference  [3]. 


5.2  Zero -False -Drop  Codes  Derived  From  Finite  Geometries 

Two  classes  of  zero -false -drop  codes  can  be  constructed  from  finite 

geometries.  One  is  derived  rom  projective  geometries  and  the  other  one 

is  derived  from  Euclidean  geometries. 

Kautz  and  Singleton  [9]  have  derived  a  bound  for  the  order  of  zero-lalse  - 

drop  code  of  constant  weight  codes.  Let  w  be  the  weight  of  any  code  woru. 

Let  P  be  the  dot  product  of  any  pair  of  code  words.  Let  p  be  the 
r  J  max 

maximum  number  of  such  p's.  If  w  >  tp  +1,  then  any  code  word 

-  ma  x  ’ 

cannot  possibly  be  contained  in  the  sum  of  any  t  other  code  words  since  it 
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overlaps  each  of  these  other  code  words  in  nc  more  than  M  max  positions. 
The  constant  weight  code  has  order  t  as  a  zero  false  drop  code  such  that 

tJilil  (5. 

-  /i 

L  max  J 


where  the  bracket  indicates  "the  integer  part  of". 

The  projective  geometry  of  dimension  m  over  GF(q),  i.e.  PG(m,q) 
has  number  of  points  equal  to 


qm  +  1-l 

n=^FT- 


(5.2) 


Any  u-space  (1  <  u  <  m-1)  of  PG(m,q)  has  number  of  points  equal  to 


w 


u+1  , 
Q _ ± 

q  i 


(5.3) 


The  total  number  of  u-spaces  of  FG(m,q)[jll^  is 


u 

I  /  nt  r  i  a  v 

I  I  (q  -q  ) 


m+ 1  i 


N  - 


i=0 


=  N  (u,  m,  q; 


(5.4) 


TT  <qU+1-qj) 

1=0 


Let  S  be  a  matrix  whose  rows  correspond  to  the  u-space 
(1  <  u  <  m-1)  of  PG(m,q).  The  matrix  S  has  N  rows,  n  columns  and  w 
ones  per  row.  We  can  regard  each  row  of  S  as  a  code  word  of  a  zero -false  - 
drop  code.  This  is  a  constant  weight  code.  The  intersection  of  two  u- 
spaces  is  a  space  of  dimension  u-1  or  less.  Hence 


max 


1  +  q  + 


+  q 


u-1 


(5.5) 
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From  equation  (5.  4), 


>  |  w  ~ 1  1  J  i  +  g  +  •  •  •  +  qu-i  _ 

L  **  max  -1  L  1  +  q  +  . . !  +  q^  ^  - 


(5.6) 


We  now  show  that  t  is  equal  to  q.  Let  v  be  a  code  word .  v  corre  - 

^  M  u  u 

sponds  to  an  u-space.  Total  number  of  (u-l)-spaces  contained  in  this  u- 
space  orthogonal  on  a  particular  (u -2) -space  is 


(1  +  q  +  . . .  +  qU  )  -  (1  +  q  . .  +  qU  2) 

(1  +q+...  +■  qU_1)  -(l  +  q  +  ...  +qU'2) 


=  1  +q 


(5.7) 


Let  be  a  code  word  not  equal  to  v  and  v  ^  contains  i-th 
(1  <  i  <  1  +  q)  of  these  (u  -  l)-spaces  which  are  contained  in  .v  .  The  code 
word  is  logically  included  in  1  +  q  different  code  words  v^  (1  <  i  <  1  +  q) 
Hence 

t=q  (5.8) 

We  obtain  ZFD^  codes  for  any  power  of  prime  q.  There  exists  m-1 
different  codes  for  a  fixed  m  and  q.  The  code  lengths  n  for  these  codes  are 
equal.  Among  these  codes,  the  best  one  is  the  code  with  maximum  number 
of  code  words  N.  From  equation  (5.4),  the  maximum  N  occurs  when 


-Df] 


(5.9) 


Similarly,  we  can  take  S  to  be  a  matrix  whose  rows  correspond  to 
u -dimensional  flats  of  the  Euclidean  geometry  of  dimension  m  over  GF(q), 
or  EG(m,q).  The  number  of  rows  N,  the  number  of  columns  n,  the  number 
of  ones  per  row  will  take  the  following  values  £llj  . 


n  =  qm 

(5.  10) 

N  =  N  (u,  m,  q)  -  N(u,  m  -1,  q) 

(5. 11) 

u 

w  **  q 

(5.  121 

r 


Two  u-dimensional  flats  EG(m,q)  intersect  an  u '-dimensional  flat 
( 0  <  u'  <  u)  or  do  not  intersect  at  all.  Hence 


/i  =  q 

max 


(5.13) 


L  max  J  L  q  J 

dl  ^  ^  m 

Let  a  ,  a  a  be  linearly  independent  over  GF(q  ) 

Let  be  a  vector  corresponding  to  an  u-dimensional  flat  consists  of  qu 


(5.  14) 


points 


d,  cL  d 

1  ,  z  ,  ,  u 

a^  a  +  a ^  a  +....+  au  a 


(5.  15) 


where  a.(  1  <  i  <  u)  runs  independently  over  GF(q). 

1  do  () 

Let  a  0  be  a  point  not  in  this  EG(m,q).  Let  v^1'  (1  <  i  <  q)  be 

vectors  corresponding  to  u-dimensional  flats  each  of  which  consists  of  points 


d  d,  d  ,  d 

o  ,  1  ,  ,  u-1  ,  u 

a  a  +  a ,  a  -f  . .  .  +  a  ,  a  +q.q 
o  1  u-1  1 


(5.  16) 


where  a.  (  0  <  i  <  u-1)  runs  independently  over  GF(q)and  q^,  q2»-*-.cl( 
are  distinct  elements  in  GF(q). 


The  code  word  v  is  logically  included  in  the  sum  of  these  code  words. 


Hence 


t  =q  -  1 


(5.  17) 


There  exists  m-1  different  ZFD  .  codes  for  a  fixed  m  and  q.  The 

q-1 

code  length  n  and  the  order  of  superimposed  code  t  (t  *  q-1)  is  fixed.  Among 
these  codes,  the  best  one  is  the  code  such  that  the  number  of  code  words  N  is 


the  largest. 


We  list  some  zero -false -drop  codes  derived  from 


finite  geometries  in  Table  5.  1. 


hi 


Table  5.  1  Zero -False -Drop  Codes  Derived  From  Finite  Geometries 
Projective  Geometry  Zero -False -Drop  Codes 


LnJ 

35 

155 

1395 

11811 

97155 

130 

1210 

33880 

31 

63 

255 

40 

121 

364 

2 

2 

■ 

3 

3 

_ 

3 

N  .  357  .  5797 

806 

2850  4745 

n  85  341 

156 

400  ,  585 

1 

t  4  |  4 

5 

Ti  r 

Euclidean  Geometry  Zero-False -Drop  Codes 

N 

117 

_ 

1080 

32670 

336 

5440 

2793 

4672 

512 

7 

ID 

81 

243 

64 

256 

125 

343 

2 

2 

3 

3 

•*r 

_ , _ 

6 

5.3  A  Method  for  Encoding  and  Retrieval  of  Documents 

A  method  of  encoding  and  retrieval  of  documents  derived  from 
algebraic  coding  theory  is  introduced  in  the  first  section.  A  new  method 
also  derived  from  algebraic  coding  theory  will  be  presented  in  this  section 
A  comparison  will  be  mack*  with  the  previous  method.  Let  n  be  the  code 
length  of  a  t -error -correcting  BCH  code.  Let  a  lx-  the  primitive  n-ih 
root  of  unity.  If  the  total  number  of  the  descriptors  in  the  dictionary  is 
less  than  n.  we  can  represent  the  j-th  descriptors  by  a  *  * 


If  a  devilment 
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has  number  of  descriptors  ,  X^,  •  •  ■  ,  X, 


v(d) 

...  ,  A  , 

we  propose  to 

d 

s  ‘’fi-V 

. . .  o  "1  where 

WdJ 

<d)  y(d) 

l  ,  ^2  ,  -  - 

.  If  a 

d 

query  has  w  number  of  descriptors  xj^,  X^;  X^\  the  deter- 

Q  -  wq 

mination  of  whether  a  document  covers  a  query  becomes  to  determine 

w 


whether  n  -  a  x 
W,  Wj  ”1 


+  a  (-x)  d  +  (-l)  d  contains  xj^ 


d  wd 

as  roots.  The  hardware  required  to  realize  this  decision  is  considerably 
simpler  than  BCH  code  decoder  which  is  required  for  retrieval  if  the  docu¬ 
ment  is  represented  by  the  syndrome.  The  hardward  required  to  encode 
cr  is  no  more  complicated  than  the  hardware  required  to  encode  s^.  An 
information  system  described  in  reference  DO  utilizes  the  variable  length 
coding  to  minimize  system  requirements  in  both  storage  and  computation. 

The  new  representation  of  document  is  also  a  variable  length  scheme,  and 
the  storage  required  is  minimum.  The  computation  seems  to  be  simpler 
than  to  determine  the  weight  of  coset  leader  from  the  syndrome  of  a  document 
plus  a  query. 


5.4  On  the  Use  of  Finite  Geometry  Codes  by  Chien’s  Formulation 

By  using  Chien's  formulation  [3]  ,  the  main  task  to  determine  whether 
a  document  covers  a  query  is  equivalent  to  find  the  weight  of  coset  leader 
from  the  syndrome.  For  the  codes  constructed  from  finite  geometries, 

Reed  Decoding  Algorithm  can  be  applied.  The  determination  of  weight  of 
coset  leader  from  syndrome  if  codes  constructed  from  finite  geometries 
instead  of  BCH  codes  are  used.  In  general,  the  efficiency  of  BCH  code  is 
higher  than  that  of  finite  geometry  code  but  the  difference  is  slight  in  many 
cases  including  difference -set  cyclic  codes  [l7]  .  We  can  apply  these  codes 
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constructed  from  finite  geometries  for  information  retrieval.  We  first 
introduce  Reed  Decoding  Algorithm  and  show  that,  in  some  cases,  the 
determination  of  the  weight  of  coset  leader  from  syndrome  is  simpler  than 
using  Reed  Decoding  Algorithm. 

Let  L  be  a  nxb  matrix  of  l’s  and  0's  whose  b  columns  are  elements  of 

the  null  space  V2  of  an  (n,  k)  binary  code  V^,.  Any  vector  of  the  null  space  is 

essentially  a  parity  check  rule  satisfied  by  code  vectors.  For  any  received 

vector  v,  vL  is  a  vector  of  b  l's  and  0's  which  contains  l’s  in  the  positions 

corresponding  parity  check  rules  that  v  fails  to  satisfy.  Now  consider  the 

T 

result  of  mbltiplying  (vL)  by  L  as  real  numbers.  The  result  will  be  a 
vector  of  n  components  that  are  integers. 

e(v)  =  (vL)  LT  (5. 18) 

There  will  be  a  contribution  of  1  in  the  j-th  component  of  e  for  each 
column  of  L  which  fails  as  a  parity  check  and  contain  a  1  in  its  j-th  position. 
Thus  the  j  -th  component  of  e  is  the  number  of  failures  of  parity  checks  that 
involve  the  j  -th  symbol  in  the  code  vectors.  Let  e  (v)  be  a  vector  whose  j  ~th 
component  is  1  if  the  j-th  component  e(v)  exceed  certain  threshold  and  0 
otherwise,  e(v)  is  considered  to  be  the  error  vector.  The  number  of  l's 
in  e(v)  is  the  number  of  errors.  For  a  projective  geometry  code  associated 
with  u-spaces  of  PG(m,q),  let  L  be  the  matrix  whose  columns  corresponding 
to  u-spaces.  For  an  Euclidean  geometry  code  associated  with  u-dimensional 
flats  of  EG(m,q),  let  L  be  the  matrix  whose  columns  corresponding  to  the 
u-dimensional  flats  with  first  digit  deleted.  If  we  let  t  equal  to 
[(qm-l)  /2<qU-l)]  and  set  threshold  at  r/2  where  r  is  the  number  of  l’s 
in  the  row  of  L,  e(v)  is  the  error  vector  provided  the  number  of  errors  oc¬ 
curred  is  no  more  than  t.  In  information  retrieval  system,  let  the  syndrome 
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of  a  document  plus  a  query  be  sd+q,  we  actually  compute  e(sd+q)  =  sd+qMLT 

T 

Instead  of  e(v)  =  vL  L  where  M  is  a  matrix  such  that  vL  =  s  , .  M. 

d-r  q 

In  some  cases,  we  need  to  compute  sd  +  q  M  only  in  order  to  distinguish 

the  weight  of  coset  leader  from  the  syndrome,  the  computation  of  sd+qM  is 

T 

much  simpler  than  the  compute'. tion  of  sd+q  ML  . 

We  demonstrate  this  as  follows:  Let  L  be  the  matrix  whose  columns 
are  u -spaces  of  PG(m,  2s).  Let  v  be  the  coset  leader  corresponding  to 
sd+q,  then  vL  =  sd+^M.  If  for  distinct  weight  of  v,  the  number  of  l’s 
in  sd+qM  is  distinct,  the  computation  of  sd+qM  is  sufficient  to  determine 
the  weight  of  coset  leader  from  the  syndrome.  We  now  show  that  the  weight 
of  sd+qMis  distinct  for  sd+q  corresponding  to  the  coset  of  weight  0, 1, 2, 
or  3  for  t  greater  than  or  equal  to  three.  The  computation  sd  .  M  is 
sufficient  to  determine  the  weight  of  coset  leader  from  the  syndrome  if  the 
code  is  a  triple  error -correcting  code. 

Let  w(x)  denote  the  weight  of  a  vector  x.  We  have 


w(sd+qM)  =  0  when  w(v)  -  0 
The  number  of  l's  in  any  row  of  L  is  r  where 


(5. 19) 


(2is-l) 


r  =  i-(m-u+l) 

fr  (2j  s  - 1) 

j=l 


(5.20) 


w(Sd+qM)  =  r  when  w(v)  =  1 
The  number  of  pairs  both  with  l's  in  any  two  rows  of  L  is  ^  where 


(5.21) 


ft 


\  *  . 
tf"  <2js-i) 

j=i 


(21S-1) 


S3 


(5.22) 
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When  weight  of  v  is  equal  to  two,  the  two  nonzero  digit  positions 

dl  <*2  s 

correspond  to  points  a  ,  a  of  PG(m,  2  ).  The  number  of  u -spaces 
contains  any  one  point  is  r  and  contains  any  two  points  is  A.  A  parity 
check  corresponding  to  an  u -space  fails  if  this  u -space  contains  one  and  only 

dl  d2 

one  of  the  points  a  ,  a  The  number  of  u -spaces  containing  cr 
d2 

and  not  a  is  r  -  A.  Thus 


w(sd+qM)  =2(r-A)  when  w(v)  =  - 


(5.23) 


When  weight  of  v  is  equal  to  three,  the  three  nonzero  digit  positions 

dl  d2  d3  s 

correspond  to  three  points  a  ,  a  and  a  of  FG(m,  2  ).  Two 
cases  possible.  In  case  (1),  these  three  points  are  linearly  dependent.  If 
an  u -space  contains  two  of  these  three  points,  then  it  must  contain  the  third 
point.  Total  number  of  u -spaces  containing  all  three  points  is  A.  Thus 


w(sd+qM)  =3(r-A)  +  A  =  3r  -  2A 


(5.24) 


In  case  (2),  the  three  points  are  linearly  independent.  Let  A^  be  the 
number  of  u -spaces  containing  these  three  points. 


m-2 

TT  <2ls-1> 

i  i=m  -u+ 1 

Ai  ~  T3 — : - 

TT  (2js-d 

j*i 


(5.25) 


The  number  of  u-spaces  containing  a  but  not  a  ^  and  a  ^  is  r  -  2  A  +  A^. 
The  number  of  u-spaces  containing  one  of  the  three  points  or  all  of  the  three 
points  is  3(r-2A  +  A^)  +  A^.  Thus 


w(scj+(.M)  =  3(r  -  2A  +  A^)  +  A^  =  3r  -  6A  -j-  4A^ 


(5.26) 
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t  =[r/2*j  (5.27) 

When  t  is  greater  than  or  equal  to  three 

r  >  6X  (5.28) 

From  equation  (5.28), 

3r  -  6A  +  4*x  >  2r  -  2A  >  r  >  0  (5. 29) 

also 

3r-2X>2r-2X  (5.30) 

Thus  the  weight  of  sd+^M  is  distinct  for  sd+^  corresponding  to  the  coset 

of  weight  0, 1, 2,  or  3. 

5.  5  On  the  Use  of  Symmetry  of  Codes  for  Retrieval 

One  form  of  symmetry  of  a  systematic  code  is  a  permutation  of  bit 
positions  in  each  code  word  (the  same  permutation  is  applied  to  all  code  words) 
which  preserves  the  code  as  a  whole.  The  idea  of  using  symmetry  of  the 
code  for  information  retrieval  is  closely  related  to  the  concept  of  permutation 
decoding  [10]  .  The  permutations  which  leave  the  code  invariant  have  a 
desired  property  for  information  retrieval  purposes.  If  G  is  a  group  of 
permutations  that  leaves  the  code  invariant,  then  G  partitions  cosets  into 
equivalence  classes.  The  coset  leaders  of  the  coset  in  the  same  equivalence 
class  have  the  same  weight.  There  exists  a  group  G ^  isomorphic  to 
G  [l9]  .  Gj  partitions  the  set  of  syndromes  into  orbits.  If  two  syndromes 
in  the  same  orbit  of  G^,  then  these  corresponding  cosets  are  in  the  same 
equivalence  class  of  G.  That  two  syndromes  in  the  same  orbit  implies  that 
their  corresponding  coset  leaders  must  have  the  same  weight.  If  a  t -error - 
correcting  code  can  be  decoded  by  permutation  decoding  and  a  received 
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sequence  has  errors  less  or  equal  to  t,  then  all  the  errors  In  this  received 
sequence  can  be  moved  to  parity  check  portion  by  a  permutation  in  G.  In  an 
equivalence  class  under  G  of  cosets  whose  coset  leaders  are  of  weight  t^ 
which  is  less  or  equal  to  t,  then  there  must  be  a  coset  leader  with  all  its  l's 
in  the  parity  check  portion.  The  syndromes  of  the  coset  leader  having 
information  digit  portion  all  zero  is  identical  to  parity  check  portion  of  this 
coset  leader.  For  any  syndrome  corresponding  coset  of  weight  t^  which  is 
less  or  equal  to  t,  there  exists  a  syndrome  in  the  same  orbit  under  G^  such 
that  the  number  of  l's  in  this  syndrome  is  t^.  It  becomes  clear  at  this 
point  that  a  t -error -correcting  code  which  can  be  decoded  by  permutation 
decoding  can  be  used  for  information  retrieval  purpose.  Recall  that  a  docu¬ 
ment  is  represented  as  a  syndrome  in  Chien's  formulation  and  the  main 
computation  is  to  determine  the  weight  of  coset  leader  from  the  syndrome. 

The  determination  of  whether  a  document  covers  a  query  is  as  follows. 

We  obtain  a  syndrome  sd+q  by  adding  the  syndrome  of  document  and  the 
syndrome  of  query  s^  .  We  determine  the  weight  of  coset  leader  of 
by  generating  the  syndromes  in  the  same  orbit  of  sd+^  under  G^.  If  one 
of  these  syndromes  has  number  of  l's  equal  to  ^  which  is  less  or  equal  to  t, 
the  weight  of  coset  leader  must  be  equal  to  t^.  Otherwise  the  weight  of 
coset  leader  is  larger  than  t.  The  effective  use  of  this  principle  lies  on  a 
method  to  generate  the  syndromes  in  the  same  orbit  without  duplication 
which  will  be  presented  at  the  end  of  this  sec  ion.  The  permutation  decoding 
is  mainly  for  low  rate  codes.  If  a  code  has  minimum  distance  d  equal  to 
2t  +  1,  then  this  code  is  capable  of  correcting  t  errors.  If  we  cannot  move 
all  errors  in  an  error  pattern  with  weight  less  than  or  equal  to  t  to  the  parity 
check  portion  by  the  permutations  which  leave  the  code  invariant,  the 
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permutation  decoding  scheme  does  not  work  for  this  code.  If  we  represent 
the  documents  and  queries  as  syndromes  of  this  code,  there  exists  a  syndrome 
sd_j_  corresponding  a  coset  of  weight  less  than  or  equal  to  t,  but  all  syndromes 
in  the  orbit  of  under  has  number  of  l's  larger  than  t.  In  this 

case,  we  need  some  modification  in  information  retrieval  process  in  order 
to  make  use  of  this  code.  Two  approaches  are  possible.  The  first  approach 
is  as  follows.  We  pick  any  syndrome  in  the  orbit  as  a  representative  if 
syndromes  in  this  orbit  corresponding  coset  leaders  of  weight  less  than  or 
equal  to  t  and  all  of  the  syndromes  have  number  of  l's  greater  than  t.  We 
store  all  representatives  of  the  orbits  having  the  preceding  property  and  the 
corresponding  weights  of  the  coset  leaders.  Given  a  syndrome  sd+(^,  we 
proceed  to  generate  the  syndromes  in  the  same  orbit.  If  one  of  the  syndromes 
has  number  of  l's  less  than  or  equal  to  t,  we  determine  the  weight  of  coset 
leader  immediately.  Otherwise  we  can  do  table  look  up  to  find  its  weight. 

In  this  approach,  we  need  storage  to  store  the  table  which  consists  of  the 
representatives  of  the  orbits  and  their  corresponding  weights.  The  approach 
of  this  method  is  practical  provided  the  table  is  not  very  large.  It  is  possible 
to  save  storage  by  the  following  second  approach.  We  define  a  vector  vc 
covers  a  coset  leader  e  if  the  information  portion  of  v£  agrees  with  any  coset 
leader  e'  in  the  equivalence  class  of  e  under  G.  We  obtain  a  set  of  covering 
vectors  that  cover  every  coset  leader.  We  call  their  corresponding  syndromes 
the  covering  syndromes.  For  any  syndrome  of  a  document  plus  a  query,  we 
can  find  a  syndrome  in  the  same  orbit  and  the  sum  of  this  syndrome  and  a 
covering  syndrome  will  have  the  number  of  l’s  less  than  or  equal  to  t  in  the 
resulting  syndrome.  In  this  case,  we  are  able  to  determine  whether  a 
particular  document  covers  a  given  query.  The  retrieval  process  is  based 
on  this  principle.  In  the  second  approach,  one  covering  syndrome  may 
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cover  several  equivalence  classes.  The  storage  requirement  in  the  second 
approach  is  less  than  that  in  the  first  approach.  In  the  first  approach,  we 
can  arrange  syndromes  having  the  same  weight  together  in  the  table.  We 
need  only  to  check  whether  the  syndrome  in  the  equivalence  class  of 
match  the  syndromes  in  the  section  of  the  table  where  syndromes  have  weight 
equal  to  the  weight  of  document  minus  the  weight  of  query.  In  the  second 
approach,  we  can  arrange  the  covering  syndromes  with  same  weight  together 
in  the  table,  we  need  only  to  check  whether  the  syndromes  in  the  equivalence 
class  of  are  covered  by  the  syndromes  which  have  weight  no  more  than 

the  weight  of  document  minus  the  weight  of  query. 

A  method  to  generate  syndromes  in  the  same  orbit  without  duplication 
is  as  follows.  The  only  known  group  of  permutations  which  leaves  any 
binary  cyclic  code  invariant  is  the  group  Gn  generated  by  cyclic  permutation 
T  and  the  permutation  U  which  maps  w  to  2u  mod  n  where  u  is  coordinate 
number  labeled  as  0, 1,  2, ...  ,  n-1  fl 


If  n  is  odded,  there  exists  a  least 

integer  t  such  that  2l  =  1  mod  n  and  U*  =  I.  It  is  easy  to  check  that  TU  =  UT 
2 

(i.e,  vTU  =  vUT  for  any  vector  v),  hence  we  may  represent  every  permutation 
in  Gn  in  the  form  UlT*  with  0  <  i  <  t-1,  0  <  j  n-1.  Now  every  power 
of  U  leaves  0  fixed;  thus  U*T*  =  U^T^  if  and  only  if  i  =  h  mod  t  and  j  =  k 
mod  n.  Thus  the  group  Gn  consists  of  nt  permutations  UT  for  all  i,  j 
such  that  0  <  i  <  t-1  andO  <  j  <  n-i. 

Gn  partitions  coscts  into  equivalence  classes.  The  weight  preserving 
group  which  is  isomorphis  to  (in  jwrtitions  the  set  of  syndromes  into  orbits. 

We  want  to  generate  the  syndrome  in  the  orbit  without  duplication.  Otherwise 
it  is  wasting  time.  This  -mne  piohlem  to  get  all  coset  leaders  in  an 
equivalence  class  without  ,oo.  Iai  v  lx-  a  coset  leader.  If  the 

equivalence  class  of  v  consist*  of  ni  disMut  elements,  then  all  the  elements 
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in  the  following  t  x  n  matrix  are  the  distinct  coret  leaders  in  the  equivalence 
class. 


vuV 


xn 


(5.31) 


where  0  <  i  <t-l,  0  <j  <  n-1. 

.  If  the  equivalence  class  of  v  does  not  have  nt  distinct  elements,  the 
distinct  elements  can  be  obtained  as  follows. 

Let  e  be  the  smallest  positive  integer  such  that  vT  =  v  I  and  t'  be 
the  smallest  positive  integer  such  that  vUt  =  vT^  for  some  j.  The 
equivalence  class  of  v  containing  the  elements 


vUXTj 


for  0  <i  <t'-l,  0  <  j  <  e  -1  (5.32) 


The  proof  is  as  follows.  The  elements  in  any  row  are  just  cyclic 
shift  of  each  other.  If  t'  is  the  smallest  integer  such  that  vUl  =  vTJ  for 
some  j.  The  elements  in  t'+  1-th  row  of  matrix  (5.31)  are  identical  to  the 
elements  in  the  first  row  of  matrix  (5.  31).  The  elements  in  the  t'  +  2  row 
is  identical  to  the  elements  in  the  second  row  etc.  If  the  row  consists  of 

^1  *2  i 

vU  is  identical  to  the  row  consists  of  vU  where  0<  ij  <  ^  5 1' -1,  then 

*2  *1 

the  row  consists  of  vU  is  identical  to  the  first  row.  This  is  a  contra¬ 
diction  because  i 2  -ij<t'.  Thus  the  first  t’  rows  of  matrix  (5. 31)  are 
the  only  distinct  rows.  We  define  the  period  of  a  vector  v  to  be  the  smallest 
integer  e  such  that  vT  =  vl.  We  need  to  show  that  the  period  of  v  is  equal 
to  the  period  of  vU1. 

Let  v(x)  be  a  polynomial  whose  coefficients  corresponding  to  v  ,  and 

2 

the  period  of  v  bee.  Let  v(x  )  be  a  polynomial  whose  coefficients  corre¬ 
sponding  to  vU  and  the  period  of  vU  be  e'- 


71 


x6v(x)  =  v(x)  mod  xn  -  1  (5.33) 

implies  that 

x6v(x2)  =  x6(v(x)  )2  =  (  xev(x)  )  v(x)  =  v(x)  v(x)  =  v(x2)  mod  xn  1  (5.  34) 
Therefore 

e'  <e  (5.35) 

xe  v(x2)  -  v(x2)  mode  xn- 1  (5.36) 


implies  that 


xe  v(x )  =  x  (v(x  )  )  =  x  (v(x)  )  =  x  (v(x)  )  (v(x)) 


2l  -2 


xe  v(x2)  (v(x)  )2  2  =  v(x2)  (v(x)  )2  2  =  v(x)  mod  x11-  1  (5.37) 


Therefore 

e’  >e  (5.38) 

From  equations  (5.  35)  and  (5. 38) 

e'=e  (5.39) 

We  have  shown  that  e'  *  e  for  i  =  1.  Similar  argument 
will  enable  us  to  prove  that  vU1  and  (vU^U  have  same  period,  hence  v  and  vU1 
have  same  period  for  any  i. 

Thus  we  have  shown  that  the  equivalence  class  of  v  containing  the 
elements  defined  in  equation  (5.32). 


5.  6  Investigation  of  Using  Concatenated  Codes 

In  Chien's  formulation,  we  represent  the  descriptors  by  the  columns  of 
the  parity  check  matrix  of  a  t -error-correcting  code  where  t  is  the  maximum 
number  of  descriptors  allowed  by  any  document.  It  is  easy  to  verify  that  a 
code  Vj  obtained  by  joining  severial  t -error  correcting  codes  is  also  capable 
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of  correcting  t -errors,  therefore  we  can  represent  the  descriptors  by  the 
parity  check  matrix  of  V^.  The  use  of  multiple  copies  of  a  single  code  will 
simplify  the  computational  process  but  on  the  other  hand,  will  require  larger 
addresses.  The  situation  is  roughly  as  follows. 

If  one  BCH  code  with  length  equal  to  2m-  1  is  used,  the  number  of 

m* 

parity  check  digit  is  mt.  Now  if  we  use  2m  number  of  the  same  code  with 

code  length  2m  m  -1.  The  number  of  allowable  descriptors  in  the  dictionary 

is  2m  (2m  m  -1)  which  is  a  little  less  than  2m-  1  when  m  is  much  great  r 

that  m'.  Let  l  equal  to  2m  .  We  will  show  that  the  fractional  amount 

of  work  required  by  using  i  codes  compared  with  that  required  by  using  one 

_2D  >  2 

code  for  each  different  weight  of  documents  is  1  jC^.  where  D  is  the 

k 

weight  of  the  document  and  the  C£s  are  the  coefficients  of  the  terms  in  the 
expansion  of  (x^  +  X2  +  • . .  +  x^.  To  illustrate,  for  D  =  10.  the  fractional 
amount  of  work  by  using  four  copies  is  0.01.  The  detail  derivation  of  the 
formula  and  a  curve  for  showing  the  percentage  amount  of  work  tequired  by 
using  l  BCH  codes  as  con  pared  with  that  of  one  BCH  code  for  different 
weights  of  a  document  is  as  follows. 

Let  D  be  the  weight  of  a  document  and  let  Q  be  the  weight  of  a  query. 

If  D  is  greater  than  Q,  we  can  determine  whether  the  document  covers  the 
query  by  comparing  the  syndromes  corresponding  to  the  cosets  of  weight  D-Q. 
Let  A.  be  the  number  of  cosets  of  weight  1,  then  A.  =  (  ”)  where  n  is  the  code 
length.  The  number  of  comparison  is  A..  We  can  use  a  concatenated  code 
obtained  by  joining  two  codes  together.  Given  a  query  of  weight  Q  with 
weight  Q1  in  the  first  section  and  weight  Q-Qj  in  the  second  section.  We  do 
not  need  to  test  all  tlie  documents  of  weight  D  which  is  greater  than  Q.  We 
need  only  to  tes'  the  documents  of  weight  D  and  with  weight  in  the  first 
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section  where  <  D-Q  +  Q^.  For  a  document  of  weight  D  with 

weight  Dj  on  the  first  section,  we  need  to  make  roughly  (  ^  _q  ) 


comparisons  to  determine  whether  a  document  covers  a  query.  We  assume 
the  probability  of  occurrence  of  each  descriptor  to  be  equal.  The  percentage 
of  documents  of  weight  on  the  first  section  and  D-D^  on  the  second  section 
in  all  documents  of  weight  D  is  ( ^  )  /2P .  Let  0  2>  ®  be  the  total  amount 
of  work  for  testing  all  documents  of  weight  D  by  using  two,  one  codes 
respectively.  Then 


nnr;D— 


(5.40) 


The  probability  of  a  query  with  weight  in  the  first  section  and  Q-Q1  in  the 
second  section  is  approximately  (  q  )  /  2^.  On  the  average  the  percentage 
amount  of  work  for  a  query  of  weight  Q  is 


where  C£s  are  the  coefficients  of  the  terms  in  the  expansion  of  (x^  +  x2)^- 
Generally,  if  we  use  a  concatenated  code  obtained  by  joining  i  BCH 
codes  together.  The  fractional  amount  of  work  required  by  using  a 


-2D  >  2 

concatenated  code  and  a  BCH  cock*  is  t  J  where  D  is  the  weight 

of  the  document  and  the  C'^s  are  the  coefficients  of  the  terms  in  the  expansion 
of  (Xj  +  Xj  +  .  .  .  +  X£  )D. 


The  derivation  of  tius  formular  is  similar  to  the 
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derivation  of  the  formula  for  i  =  2  case.  The  percentage  amount  of  work 
using  1  BCH  codes  comparing  with  one  BCH  code  under  different  weight  of 
documents  is  plotted  in  Fig.  1.  7' he  curves  are  independent  of  the  weight 

of  a  query. 


Weight  of  a  document 


Fig.  1.  The  percentage  amount  of  work  required  by  using  l  BCH 
codes  as  compared  with  that  of  one  BCH  code  for  different 
weights  of  a  document. 
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VI.  CONCLUSIONS  AND  FURTHER  PROBLEMS 
6.  1  Conclusions 

Two  related  classes  of  codes  derived  from  Euclidean  geometries  have 
been  found.  These  codes  can  be  u-step  decoded  by  threshold  decoding. 

These  codes  are  comparable  to  projective  geometry  codes  which  are  moder¬ 
ately  efficient  random -error -correcting  codes  for  practical  values  of  code 
length  and  rate  and  can  be  decoded  with  a  relatively  modest  amount  of  equip¬ 
ment.  Hence  it  appears  that  these  codes  may  be  suxcable  for  use  in  error 
control  systems  requiring  random -error  correction.  The  polynomial  version 
of  Rudolph's  projective  geometry  codes  has  been  found  for  practial  values  of 
code  length  and  rate.  It  is  important  to  find  the  polynomial  version  of  a  cyclic 
code  because  we  need  to  know  the  generator  polynomial  for  encoding  purposes 
and  furthermore  the  number  of  information  digits  can  be  determined  easily 
from  the  generator  polynomial.  One -step  threshold  decoding  using  not 
necessarily  orthogonal  parity  checks  is  found  to  be  applicable  to  BCH  codes 
and  codes  derived  from  Euclidean  geometries.  We  found  that  it  is  possible 
to  improve  upon  Rudolph’s  decoding  methed  in  some  cases.  If  projective 
geometry  codes,  Euclidean  geometry  codes,  and  DCH  codes  are  decoded  by 
one-step  threshold  decoding  methods,  their  efficiency  and  error -correcting 
ability  are  comparable.  Two  new  classes  of  zero -false -drop  codes  have 
been  found  which  compare  favorably  with  the  previously  known  classes. 

Several  results  related  to  the  application  of  algebraic  coding  theory  are  ob¬ 
tained.  They  may  be  useful  in  a  practical  information  retrieval  system. 

6.2  Further  Research  Areas 

Because  of  the  easy  implementation  of  threshold  decoder  and  abundant 
results  on  the  construction  of  threshold-decodable  codes,  it  appears  that 
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threshold  decoding  will  continue  to  be  a  promising  area  of  research  directed 
toward  improving  the  reliability  of  data  transmission  in  communication  systems. 
Some  promising  areas  for  future  research  are: 

1.  To  find  more  powerful  threshold  decoding  algorithms  and  evaluate 
error -correcting  ability  of  these  algorithms  for  cyclic  codes. 

We  have  shown  that  not  all  cyclic  codes  can  be  L-step  orthogonalized. 

The  L-step  orthogonalization  procedure  can  be  generalized  by  allowing  the 
use  of  non -orthogonal  parity  checks  et  each  step.  One -step  decoding  methods 
are  the  only  ones  that  have  been  investigated  by  means  of  their  relation  to 
balanced  incomplete  block  designs.  As  is  well  known,  many  error  patterns 
with  weights  greater  that  £(d-l)/2j  ,  where  d  is  the  minimum  distance  of  the 
code,  can  be  corrected  by  threshold  decoding.  It  would  be  desirable  to  find 
more  powerful  threshold  decoding  algorithms  and  to  evaluate  more  precisely 
error -correcting  ability  of  threshold  decoding  algorithms. 

2.  To  construct  new  codes  suitable  for  threshold  decoding  or  to  improve 
the  threshold  decoders  for  known  codes. 

The  finite  geometry  codes  are  not  as  numerous  as  BCH  codes.  It  is 
easy  to  define  a  class  of  codes  which  contains  the  Euclidean  geometry  codes 
and  projective  geometry  codes  as  subclasses,  as  follows. 

Let  a  be  a  primitive  element  of  GF(qm).  Let  a  be  an  integer  which 

m  h 

divides  q  -1.  Let  C  be  a  code  whose  generator  polynomial  contains  a  n  as 

roots  for  h  which  are  less  than  qm-l,  are  divisible  by  a,  and  satisfy  the 

condition  0  <  w^  (hp-1)  <  I  where  0  <  j  <  s-1  and  I  is  a  fixed  integer.  When 

I  is  a  multiple  of  (q - 1)  and  a  is  equal  to  one,  we  have  a  Euclidean  geometry 

code.  When  I  is  a  multiple  of  q-1  and  a  is  equal  to  q-1,  we  have  a  projective 

geometry  code.  For  other  values  of  I  and  a  we  obtain  codes  that  have  not  been 

investigated  previously.  The  number  of  information  digits  and  the  BCH  bound 
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for  these  codes  can  be  determined  easily.  To  establish  a  decoding  algorithm 
for  these  codes,  possible  by  exploiting  their  geometrical  properties,  would  be 
very  useful. 

Improvement  of  the  decoders  for  finite  geometry  codes  are  possible. 

The  choice  of  u' -dimensional  flat  parity  checks  in  the  implementation  of  u-step 
decoders  is  not  unique.  It  is  quite  possible  that  one  choice  would  lead  to 
simpler  circuitry  than  the  others.  The  number  of  majority  gates  may  Do 
reduced  by  detailed  evaluation  of  the  dependency  of  these  u' -dimensional  flat 
parity  checks.  As  to  one -step  decoders  for  projective  geometry  codes,  we 
have  demonstrated  in  section  3.4  the  possibility  of  using  some  but  not  all 
u -dimensional  flats  for  majority  voting  and  still  achieving  the  same  guaranteed 
decodable  distance  as  Rudolph's  method  does.  The  number  of  parity  checks 
in  the  one -step  decoding  algorithm  for  Euclidean  geometry  codes  and  BCH  codes 
proposed  in  section  4.  2  may  be  more  than  enough.  The  guaranteed  decodable 
distance  1  +  [(qm-l)  /  (qu-l)Jfor  one-step  decoding  of  finite  geometry  codes 
bv  this  decoder  is  only  a  bound  which  may  underestimates  the  error -correcting 
ability  of  the  decoder.  Further  investigation  of  these  questions  may  lead  to 


fruitful  results. 
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13  ABSTR AC  T 


Finding  cyclic  codes  that  can  be  decoded  efficiently  by  threshold  logic  is  im¬ 
portant  because  the  decoders  are  very  easy  to  implement.  Two  related  classes  of 
codes  derived  from  Euclidean  geometries  are  presented.  The  code  length,  number  of 
information  symbols,  and  minimum  distance  are  shown  to  be  related  by  means  of 
parameters  of  a  code.  These  codes  can  be  decoded  with  a  variation  of  the  original 
algorithm  proposed  by  Reed  for  Reed-Muller  codes.  We  show  that  these  codes  are 
comparable  to  Rudolph's  projective  geometry  codes  which  are  known  to  have  the 
following  important  feature.  For  a  given  code  length  and  rate,  the  projective 
geometry  code  has  relatively  large  minimum  distance  and  the  decoder  is  usually 
very  simple.  We  have  deceived  a  class  of  codes  from  projective  geometries  in 
terms  of  the  roots  of  generator  polynomials.  These  codes  are  shown  to  contain 
the  corresponding  non-primitive  Reed-Muller  codes  discovered  by  Weldon  (18)  as 
c-ubcodes,  in  many  cases,  proper  subcodes  with  the  same  error -correct  in.g  ability 
by  L-step  orthogonaiization  procedure.  These  codes  are  found  to  be  identical  to 
Rudolph's  projective  geometry  codes  for  all  useful  parameters  of  the  codes. 
Threshold  decoding  of  BCH  codes  and  the  generality  of  L-step  orthogonaiization 
procedure  to  cyclic  codes  are  discussed.  Investigation  on  the  application  of 
coding  theory  to  information  retrieval  is  presented. 
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